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PREFACE 


As suggested by the title, this book attempts to present in an 
objective, systematic manner the primary, or fundamental, molar 
principles of behavior. It has been written on the assumption that 
all behavior, indi'vddual and social, moral and immoral, normal and 
psj'chopathic, is generated from the same primary laws; that the 
differences in the objective behavioral manifestations are due to 
the differing conditions under which habits are set up and function. 
Consequently the present work may be regarded as a general intro- 
duction to the theory of all the behavioral (social) sciences. 

In an effort^ to insure its intelligibility to all educated readers, 
the complicated equations and other more technical considerations 
have been relegated to terminal notes, where they may be found 
by the technically trained who care to consult them. The formal 
verbal statements of the primary principles are presented in special 
type at the ends of the chapters in which they emerge from the 
analysis. A convenient glossary of the various symbols employed 
is pro\dded. 

There remains the pleasant duty of recording my numerous 
obligations. First in order of irnportance is my gratitude to the 
Institute of Human Relations and to its Director, Mark A. May, 
for the leisure and the generous provision of innumerable accessory 
facilities which have made possible the preparation of this work, 
and for the stimulation and instruction received during several 
years of Monday night Institute staff meetings. Contributing to 
the same end have been the stimulation, criticism, and suggestions 
given by the members of my seminar in psychology in the Yale 
Graduate School. Few things have been so pleasant and so profit- 
able scientifically as the contacts with the brilliant and vigorous 
young personalities encountered in these situations. The most of 
whatever is novel in the pages of this book has been generated in 
one way or another from these contacts. 

To certain individuals a more specific debt of gratitude must 
be acknowledged. All the original figures were drawn by Joy 
Richardson and Don Olson. Bengt Carlson fitted equations to the 
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Bumerous empirical curves, and is responsible for all of the more 
technical mathematical material which appears in the terminal 
notes. Maivdn J. Herbert- prepared the subject index. To my 
laboratory research assistants of the last ten years, Walter C. 
Shipley, St. Clair A. Switzer, Milton J. Bass, Eliot H. Rodnick, 
Carl Iver Havland, Douglas G. Ellson, Richard Bugelski, Glen L. 
Heathers, Peter Arakelian, Chester J. Hill, John L. Finan, Stanley 
B. Williams, C. Theodore Perin, Richard 0. Rouse, Charles B. 
WocKibury, and Ruth Hays, I am indebted for the conscientious per- 
formance of numerous experiments which were especially planned 
for this work, and which naturally make up a considerable propor- 
tion of the empirical material employed. The loyal cooperation and 
kindly day fay day suggestions and criticisms of these splendid 
young people have made that phase of the task a rarely satisfying 
labor. 

Another and smaller group of individuals have given invalu- 
able aid in the preparation of the manuscript. Eleanor Jack 
Gibson read much of an early version of the manuscript and made 
helpful suggestions. Irvin L. Child read the manuscript in a late 
stage of revision and made numerous valuable criticisms and sug- 
gestions, To Kenneth L. Spence I owe a debt of gratitude which 
cannot adequately be indicated in this place; from the time when 
the ideas here put forward were in the process of incubation in 
my graduate seminar and later when the present work was being 
plaimed, on tihrou^ its many revisions, Dr. Spence has contributed 
generously and effectively with su^estions and criticisms, large 
of which have been utilized without indication of their 
origin. Finally, to Ruth Hays I am deeply indebted for the tran- 
of hundreds of pages of unbelievably illegible handwrit- 
for ihe preparation of the name index, and for absolutely 
indispenmble a^stance with the formal aspects of the manuscript.. 

C. L. H. 


New Havm 
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CHAPTER I 


The Nature of Scientific Theory 

This book is the beginning of an attempt to sketch a systematic 
objective theory of the behavior of higher organisms. It is accord- 
ingly important at the outset to secure a clear notion of the essential 
nature of systematic theory in science, the relation of theory to 
other scientific activities, and its general scientific status and 
importance. 

THE TWO ASPECTS OF SCIENCE! EMPIEICAL AN1> EXPLANATOEY 

Men are ever engaged in the dual activity of making observa- 
tions and then seeking explanations of the resulting revelations. 
All normal men in all times have observed the rising and setting 
of the sun and the several phases of the moon. The more thought- 
ful among them have then proceeded to ask the question, ^‘Why? 
Why does the moon wax and wane? Why does the sun rise and 
set, and where does it go when it sets?^^ Here we have the two 
essential elements of modem science; the making of observations 
constitutes the empirical or factual component, and the systematic 
attempt to explain these facts constitutes the theoretical com- 
ponent. As science has developed, specialization, or division of 
labor, has occurred; some men have devoted their time mainly to 
the making of observations, while a smaller number have occupied 
themselves largely with the problems of explanation. 

During the infancy of science, observations are for the mojst 
part casual and qualitative — ^the sim rises, beats down strongly at 
midday, and sets; the moon grows from the crescent to full and 
then diminishes. Later observations, usually motivated by prac- 
tical considerations of one kind or another, tend to become quan- 
titative and precise — ^the number of days in the moon’s monthly 
cycle are counted accurately, and the duration of the sun’s yearly 
course is determined with precision. As the need for more exact 
observations increases, special tools and instruments, such as gradu- 
ated measuring sticks, protractors, clocks, telescopes, and micro- 
scopes, are devised to facilitate the labor. Kindred tools relating 
to a given field of science are frequently assembled imder a single 

1 



2 PRINCIPLES OF BEHAVIOR 

roof for convenience of use; such an assemblage becomes a labora- 
torv”. 

As scientific investigations become more and more searching 
it is discovered that the spontaneous happenings of nature are not 
adequate to permit the necessary observations. This leads to the 
setting up of special conditions which will bring about the desired 
events under circumstances favorable for such observations; thus 
experiments originate. But even in deliberate experiment it is 
often extraordinarily diflScult to determine with which among a 
complex of antecedent conditions a given consequence is primarily 
associated; in this way arise a complex maze of control experiments 
and other technical procedures, the general principles of which are 
common to all sciences but the details of which are peculiar to 
each. Thus in brief review we see the characteristic technical 
development of the empirical or factual aspect of science. 

Complex and difficult as are some of the problems of empirical 
science, th(^e of scientific theory are perhaps even more difficult 
of solution and are subject to a greater hazard of error. It is not 
a matter of chance that the waxing and waning of the moon was 
oteerv-ed for countless millennia before the comparatively recent 
times when it was at last successfully explained on the basis of the 
Copemican hypoth^a. Closely paralleling the development of the 
technical aids employed by empirical science, there have also grown 
up in the field of scientific theory a complex array of tools and 
sj^ial prcKjedures, mostly mathematical and logical in nature, de- 
signed to aid in coping with th^ peculiar difficulties. Because of 
the elementary nature of the present treatise, very little explicit 
di^ussion of the use of such tools will be given. 

TBM I>H>XJCTIVE KATURB OF BdENTTElC THEORY AND 
EXPLANATION 

The team ttmory in the behavioral or “social” sciences has a 
variety of current meanings. As understood in the present work, 
a theory is a systematic d^uctive derivation of the secondary 
principlei of ot^rvable phenomena from a relatively small number 
of primary principle or postulates, much as the secondary prin- 
€iplm or th^M^ns of geometry are all ultimately derived as a logical 
Meraicfay item a few original definitions and primary principles 
axioms. In science an observed event is said to be explained 
wim ihe pmpemUon «q>r^ing it has been logically derived from 
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a set of definitions and postulates coupled with certain observed 
conditions antecedent to the event. This, in brief, is the nature 
of scientific theory and explanation as generally understood and 
accepted in the physical sciences after centuries of successful devel- 
opment (i, pp. 495-496) . 

The preceding summary statement of the nature of scientific 
theory and explanation needs considerable elaboration and exem- 
plification. Unfortunately the finding of generally intelligible 
examples presents serious difficulties; because of the extreme youth 
of systematic beha\dor theory (i, p. 501 ff.; 2 , p. 15 ff.) as here 
understood, it is impossible safely to assume that the reader pos- 
sesses any considerable familiarity with it. For this reason it will 
be necessary to choose all the examples from such physical sciences 
as are now commonly taught in the schools. 

We can best begin the detailed consideration of the nature of 
scientific explanation by distinguishing it from something often 
confused with it. Suppose a naive person with a moderate-sized 
telescope has observed Venus, Mars, Jupiter, and Saturn, together 
with numerous moons (including our own), and found them all 
to be round in contour and presumably spherical in form. He 
might proceed to formulate his observations in a statement such as, 
'^All heavenly bodies are spherical,” even though this statement 
goes far beyond the observations, since he has examined only a 
small sample of these bodies. Suppose, next, he secures a better 
telescope; he is now able to observe Uranus and Neptune, and finds 
both round in contour also. He may, in a manner of speaking, be 
said to explain the sphericity of Neptune by subsuming it under 
the category of heavenly bodies and then applying his previous 
empirical generalization. Indeed, he could have predicted the 
spherical nature of Neptune by this procedure before it was observed 
at all: 

All heavenly bodies are spherical. 

Neptune is a heavenly body, 

Therefore Neptime is spherical. 

Much of what is loosely called explanation in the field of be- 
havior is of this nature. The fighting propensities of a chicken 
are explained by the fact that he is a game cock and game cocks 
are empirically known to be pugnacious. The gregariousness of a 
group of animals is explained by the fact that the animals in ques- 
tion are dogs, and dogs are empirically known to be gregarious. 
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As we have seen, it is possible to make concrete predictions of a 
sort on the basis of such generalizations, and so they have signifi- 
cance. Nevertheless this kind of procedure — ^the subsumption of a 
particular set of conditions xmder a category involved in a pre- 
viously made empirical generalization — is not exactly what is 
regarded here as a scientific theoretical explanation. 

For one thing, a theoretical explanation as here understood 
grows out of a problem, e.g., “What must be the shape of the 
heavenly bodies?” Secondly, it sets out from certain propositions 
or statements. These propositions are of two rather different kinds. 
Propositions of the first type required by an explanation are those 
stating the relevant initial or antecedent conditions. For example, 
an explanation of the shape of heavenly bodies might require the 
preliminary a^umption of the existence of (1) a large mass of 
(2) more or less plastic, (3) more or less homogeneous matter, 
(4) initially of any shape at all, (5) the whole located in otherwise 
empty space. But a statement of the antecedent conditions is not 
enough; there must also be available a set of statements of general 
principles or rules of action relevant to the situation. Moreover, 
the particular principles to be utilized in a given explanation must 
be chosen from the set of principles generally employed by the 
theorist in explanations of this class of phenomena, the choice to 

made strictly on the basis of the nature of the question or 
problem under consideration taken in conjunction with the ob- 
served or assumed conditions. For example, in the case of the 
shai:^ of the heavenly bodies the chief principle employed is the 
Newtonian law of gravitation, namely, that every particle of matter 
attracts every other particle to a degree proportional to the product 
of their mass^ and inversely proportional to the square of the dis- 
tance separating them. These principles are apt themselves to 

verbal formulations of empirical generalizations, but may be 
merely happy conjectures or guesses found by a certain amount of 
antecedent trial-and-error to agree with observed fact. At all 
e\ent& they originate in one way or another in empirical observa- 
tion. 

The concluding phase of a scientific explanation is the deriva- 
Um of the smw^ to the motivating question from the conditions 
md the principles, taken Jointly, by a process of inference or rea- 
soning. For example, it follows from the principle of gravitation 
riist empty which might at any time have existed within 

tile ma^ of a heavenly body would at once be closed. Moreover, 
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if at any point on the surface there were an elevation and adjacent 
to it a depression or valley, the sum of the gravitational pressures 
of the particles of matter in the elevation acting on the plastic 
material beneath would exert substantially the same pressure later- 
ally as toward the center of gravity. But since there would be no 
equal lateral pressure originating in the valley to oppose the pres- 
sure originating in the elevation, the matter contained in the 
elevation would flow into the valley, thus eliminating both. This 
means that in the course of time all the matter in the mass under 
consideration would be arranged about its center of gravity with 
no elevations or depressions; i.e., the radius of the body at all 
points would be the same. In other words, if the assumed mass w^ere 
not already spherical it would in the course of time automatically 
become so (4, p* 424) . It follows that all heavenly bodies, includ- 
ing Neptune, must be spherical in form. 

The significance of the existence of these two methods of 
arriving at a verbal formulation of the shape of the planet Neptune 
may now be stated. The critical characteristic of scientific theo- 
retical explanation is that it reaches independently through a 
process of reasoning the same outcome with respect to (secondary) 
principles as is attained through the process of empirical general- 
ization. Thus scientific theory may arrive at the general proposi- 
tion, ^^All heavenly bodies of sufficient size, density, plasticity, and 
homogeneity are spherical,” as a theorem, simply by means of a 
process of inference or deduction without any moons or planets 
having been observed at all. The fact that, in certain fields at 
least, practically the same statements or propositions can be at- 
tained quite independently by empirical methods as by theoretical 
procedures is of enormous importance for the development of 
science. For one thing, it makes possible the checking of results 
obtained by one method against those obtained by the other. It is 
a general assumption in scientific methodology that if everything 
entering into both procedures is correct, the statements yielded by 
them will never be in genuine conflict. 

SCIENTIFIC EXPLANATIONS TEND TO COME IN CLUSTERS 
CONSTITUTING A LOGICAL HIERARCHY 

This brings us to the important question of what happens in 
a theoretical situation when one or more of the supposed ante- 
cedent conditions are changed, even a little. For example, when 
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considering the theoretical shape of heavenly bodies, instead of the 
mass being completely fluid it might he assumed to be only slightly 
plastic. It is evident at once, depending on the degree of plasticity, 
the size of the mass, etc., that there may be considerable deviation 
from perfect sphericity, such as the irregularities observable on the 
surface of our own planet. Or suppose that we introduce the addi- 
tional condition that the planet revolves on its axis. This neces- 
sarily implies the entrance into the situation of the principle of 
centrifugal force, the familiar fact that any heavy object whirled 
around in a circle will pull outward. From this, in conjunction with 
other principles, it may be reasoned (and Newton did so reason) 
that the otherwise spherical body would bulge at the equator; 
moreover, this bulging at the equator together with the principle 
of gravity would, in turn, cause a flattening at the poles (4 p. 424). 
Thus we see how it is that as antecedent conditions are varied the 
theoretical outcome (theorem) following from these conditions will 
also vary. By progressively varying the antecedent conditions in 
this way an indefinitely large number of theorems may be derived, 
but all from the very same group of basic principles. The prin- 
ciples are employed over and over in different combinations, one 
combination for each theorem. Any given principle may accord- 
ingly be employed many times, each time in a different context. 
In this way it comes about that scientific theoretical systems po- 
tentially have a very large number of theorems (secondary prin- 
ciple) but relatively few general (primary) principles. 

We note, next, that in scientific systems there are not only many 
tiheoreos derived by a process of reasoning from the same assem- 
blage of general principle, but these theorems take the form of a 
logical hierarchy: first-order theorems are derived directly from the 
origmal general principle; second-order theorems are derived with 
ttie aid of the first-order theorems; and so on in ascending hierarchi- 
cal orders. Thus in deducing the flattening of the planets at the 
Newton anploy^ the logically antecedent principle of cen- 
trifugal force which, while an easily observable phenomenon, can 
iimli be d^ucal, and so was deduced by Newton, from the condi- 
ticms of circular motion. The principle of centrifugal force accord- 
ingjy is m example of a lower-order theorem in Newtou'S theo- 
retecal system {4, p- 40 ff.). On the other hand, Newton derived 
fit®! bulpng of the earth at its equator what is known as the 
^pr^^on of the ^uinoxes’^ U, p. 580), the fact that the length 
of ttie determined by tte time elapsing from one occasion 
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when the shadow cast by the winter sun at noon is longest to the 
next such occasion, is shorter by some twenty minutes than the 
length of the year as determined by noting the time elapsing from 
the conjunction of the rising of the sun with a given constellation 
of stars to the next such conjunction. This striking phenomenon, 
discovered by Hipparchus in the second century b.c., "was first ex- 
plained by Newton. The precession of the equinoxes accordingly 
is an example of a higher-order theorem in the Newtonian theo- 
retical system. 

From the foregoing it is evident that in its deductive nature 
systematic scientific theory closely resembles mathematics. In this 
connection the reader may profitably recall his study of geometry 
with (1) its definitions, e.g., point, line, surface, etc., (2) its primary 
principles (axioms), e.g., that but one straight line can be drawn 
between two points, etc., and following these (3) the ingenious and 
meticulous step-by-step development of the proof of one theorem 
after the other, the later theorems depending on the earlier ones in 
a magnificent and ever-mounting hierarchy of derived propositions. 
Proper scientific theoretical systems conform exactly to all three of 
these characteristics.^ For example, Isaac Newton’s Principia (4), 
the classical scientific theoretical system of the past, sets out with 
(1) seven definitions concerned with such notions as matter, mo- 
tion, etc., and (2) a set of postulates consisting of his three famous 
laws of motion, from which is derived (3) a hieraichy of seventy- 
three formally proved theorems together with large numbers of 
appended corollaries. The theorems and corollaries are concerned 
with such concrete observable phenomena as centrifugal force, the 
shape of the planets, the precession of the equinoxes, the orbits of 
the planets, the flowing of the tides, and so on. 

SCIENTIFIC THEORY IS NOT ARGUMENTATION 

The essential characteristics of scientific theory may be further 
clarified by contrasting it with argumentation and even with 
geometry. It is true that scientific theory and argument have 
similar formal or deductive structures; when ideally complete both 
should have their terms defined, their primary principles stated, 

^The formal structure of scientific theory differs in certain respects from 
that of pure mathematics, but these differences need not be elaborated here; 
the point to be emphasized is that mathematics and scientific theory are 
alike in that they are both strictly deductive in their natures. 
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and tLeir conclusions derived in an explicit and logical manner. In 
spite of this superficial similarity, however, the two differ radically 
in their essential natures, and it would be difficult to make a more 
serious mistake than to confuse them. Because of the widespread 
tendency to just this confusion, the distinction must be stressed. 
An important clue to the understanding of the critical differences 
involved is found in the objectives of the two processes. 

The 'jmmary objective of argumentation is persuasion. It is 
socially aggressive; one person is deliberately seeking to influence 
or coerce another by means of a process of reasoning. There is 
thus in alimentation a proponent and a recipient. On the surface 
the proponents objective often appears to be nothing more than to 
iuduce the recipient to assent to some more or less abstract proposi- 
tion. Underneath, however, the ultimate objective is usually to 
lead the recipient to some kind of action, not infrequently such as 
to be of advantage to the proponent or some group with which the 
proponent is allied. Now, for the effort involved in elaborate argu- 
mentation to have any point, the proposition representing the objec- 
tive of the proponent's efforts must be of such a nature that it 
cannot be substantiated by direct observation. The recipient can- 
not have made such observations; otherwise he would not need to 
be convinced. 

Moreover, for an argument to have any coerciveness, the 
recipient must^ believe that the definitions and the other basic 
assumptions of the argument are sound; the whole procedure is that 
of systematically transferring to the final culminating conclusion 
the assent which the recipient initially gives to these antecedent 
statemmts. In this connection it is to be noted that systems of 
phil(^phy, metaphysics, theology, etc., are in the above sense at 
bottom elaborate arguments or attempts at persuasion, since their 
conclusions are of such a nature that they cannot possibly be estab- 
lished by direct observation. Consider, for example, Proposition 
XIV of Part One of Spinoza’s Ethic (5 ) : 

^^B^des God no substance can be, . . .** 

TTie primary objective of scientific theory, on the other hand, 
is tile establishment of scientific principles. Whereas argumenta- 
ticm is soriaUy a^ressive and is directed at some other person, 
natiiral sei^ce theory is aggressive towards the problems of nature, 
it uses lo^c as a tool primarily for mediating to the scientist 
Mmself a more perfect understanding of natural processes. If New- 
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ton had been a scientific Robinson Crusoe, forever cut off from 
social contacts, he would have needed to go through exactly the 
same lo^cal processes as he did, if he were himself to have under- 
stood why the heavenly bodies are spherical rather than cubical. 
Naturally also, argumentation presupposes that the proponent has 
the solution of the question at issue fully in hand; hence his fre- 
quent overconfidence, aggressiveness, and dogmatism. In contrast 
to this, the theoretical activities of science, no less than its em- 
pirical acthdties, are directed modestly toward the gradual, piece- 
meal, successive-approximation establishment of scientific truths. 
In a word, scientific theory is a technique of investigation, of seek- 
ing from nature the answers to questions motivating the investi- 
gator; it is only incidentally and secondarily a technique of per- 
suasion. It should never descend to the level of mere verbal 
fencing so characteristic of metaphysical controversy and argu- 
mentation. 

Some forms of argumentation, such as philosophical and meta- 
physical speculation, have often been supposed to attain certainty 
of their conclusions because of the “self-evident^^ nature of their 
primary or basic principles. This is probably due to the influence 
of Euclid, who believed his axioms to be “self-evident truths.” At 
the present time mathematicians and logicians have largely aban- 
doned intuition or self-evidentiality as a criterion of basic or any 
other kind of truth. Similarly, scientific theory recognizes no 
axiomatic or self-evident truths; it has postulates but no axioms in 
the Euclidian sense. Not only this; scientific theory differs sharply 
from argumentation in that its postulates are not necessarily sup- 
posed to be true at all. In fact, scientific theory largely inverts 
the procedure found in argument: whereas argument reaches belief 
in its theorems became of antecedent belief in its postulates^ soLen-- 
tific theory reaches belief in its postulates to a considerable extent 
through direct or observational evidence of the soundness of its 
theorems (^, p. 7). 

THEORETICAL ANI) EMPIRICAL PROCEDURES CONTRIBUTE 
JOINTLY TO THE SAME SCIENTIFIC END 

No doubt the statement that scientific theory attains belief in 
its postulates through belief in the soundness of its theorems will 
come as a distinct surprise to many persons, and for several reasons. 
For one thing, the thoughtful individual may wonder why, in spite 
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of the admitted absence of self-evident principles or axioms, the 
basic principles of scientific systems are not firmly established at 
the outset by means of observation and experiment. After such 
establishment, it might be supposed, the remaining theorems of 
the system could all be derived by an easy logical procedure with- 
out the laborious empirical checking of each, as is the scientific 
practice. Despite the seductive charm of its simplicity this 
methodology is, alas, impossible. One reason is, as already pointed 
out (p. 3), that the generalizations made from empirical investi- 
gations can never be quite certain. Thus, as regards the purely 
empirical process every heavenly body so far observed might be 
spherical, yet this fact would only increase the probability that the 
next one encountered would be spherical j it would not make it 
certain. The situation is exactly analogous to that of the con- 
tinued drawing of marbles at random from an urn containing 
white marbles and suspected of containing black ones also. As 
one white marble after another is drawn in an unbroken succession 
the probability increases that the next one drawn will be white, but 
there can never come a time when there will not be a margin of 
uncertainty. On the ham of observation alone to say that all 
heavenly bodies are spherical is as unwarranted as it would be to 
state positively that all the marbles in an urn must be white be- 
cause a limited random sampling has been found uniformly to be 
white. 

But even for the sampling of empirical theory or experimental 
truth to be effective, the sampling of the different situations in- 
volved must be tally random. This means that the generalization 
in question must be tried out empirically with all kinds of ante- 
cedent conditions; which implies that it must be tested in conjunc- 
tion with tae operation of the greatest variety of other principles, 
imgly and in thdr various combinations. In very simple situations 
the ^ieniist in ^arch of primary principles needs to do little more 
tiian formulate his observations. For example, it is simple enough 
to ob^rve the falling of stones and similar heavy objects, and 
eroa to note that such objects descend more rapidly the longer the 
time daf^d sin^ tiiey were released from rest. But the moment 
two or more major principles are active in the same situation, the 
tadk of <tetarminmg the role played by the one under investigation 
far mcare difficult. It is not at all obvious to ordinary 
that the principle of gravity operative in the behavior 
ai fr^y faffing bodies is the same as that operative in the behavior 
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of the common pendulum; ordinary falling bodies, for example, do 
not manifest the phenomena of lateral oscillation. The relation of 
gravity to the behavior of the pendulum becomes evident only as 
the result of a fairly sophisticated mathematical analysis requiring 
the genius of a Galileo for its initial formulation. But this 
"mathematical analysis,” be it noted, is full-fledged scientific theory 
with boTKi fids theorems such as; the longer the suspension of the 
pendulum, the slower the beat. 

In general it may be said that the greater the number of addi- 
tional principles operative in conjunction with the one under inves- 
tigation, the more complex the theoretical procedures which are 
necessary. It is a much more complicated procedure to show theo- 
retically that pendulums should beat more slowly at the equator 
than at the pole than it is to deduce that pendulums with long 
suspensions should beat more slowly than those with short ones. 
This is because in the former situation there must be taken explicitly 
into consideration the additional principle of the centrifugal force 
due to the rotation of the earth about its axis. 

At the outset of empirical generalization it is often impossible 
to detect and identify the active scientific principle by mere obser- 
vation. For example, Newton’s principle that all objects attract 
each other inversely as the squares of the distances separating them 
was a daring conjecture and one extending much beyond anything 
directly observable in the behavior of ordinary falling bodies. It 
is also characteristic that the empirical verification of this epoch- 
making principle was first secured through the study of careful 
astronomical measurements rather than through the observation 
of small falling objects. But the action of gravity in determining 
the orbits of the planets is even less obvious to ordinary unaided 
observation than is its role in the determination of the behavior 
of the pendulum. Indeed, this can be detected only , by means of 
the mathematics of the ellipse, i.e., through a decidedly sophisticated 
theoretical procedure, one which had to await the genius of Newton 
for its discovery. 

Earlier in the chapter it was pointed out that the theoretical 
outcome, or theorem, derived from a statement of supposed ante- 
cedait conditions is assumed in science always to agree with the 
empirical outcome, provided both procedures have been correctly 
performed. We must now note the further assumption that if there 
is disagreement between the two outcomes there must be something 
wrong wilh at least one of the principles or rules involved in the 
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derivation of the theorem; empirical observations are regarded as 
primary, and wherever a generalization really conflicts with obser- 
vation the generalization must always* give way. When the break- 
down of a generalization occurs in this way, an event of frequent 
occurrence in new fields, the postulates involved are revised if 
possible so as to conform to the known facts. Following this, de- 
ductions as to the outcome of situations involving still other com- 
binations of principles are made; these in their turn are checked 
against observations; and so on as long as disagreements continue 
to occur. Thus the determination of scientific principles is in con- 
siderable part a matter of symbolic trial-and-error. At each trial 
of this process, where the antecedent conditions are such as to in- 
volve jointly several other presumptive scientific principles, sym- 
bolic or theoretical procedure is necessary in order that the investi- 
gator may know the kind of outcome to be expected if the supposed 
principle specially under investigation is really acting as assumed. 
The empirical procedure is necessary in order to determine whether 
the antecedent conditions were really followed by the deductively 
expected outcome. Thus both theoretical and empirical procedures 
are indispensable to the attainment of the major scientific goal — 
that of the determination of scientific principles. 

HOW THE EMPIRICAL VERIFICATIOH OF THEOREMS IHDIRECTLY 
STJBSTAHTIATES POSTULATES 

But how can the empirical verification of the implications of 
theorems derived from a set of postulates establish the truth of the 
postulates? In seeking an answer to this question we must note 
at the ouiset that absolute truth is not thus established. The con- 
ciusicHi reached in science is not that the postulates employed in 
the derivation of the empirically verified theorem are thereby shown 
to be true beyond doubt, but rather that the empirical verification 
of the theorem has increased the probability that the next theorem 
dOT'rai from these postulates in conjunction with a different set 
of ant^j^ent conditions will also agree with relevant empirical 
determinatioffls. And this conclusion is arrived at on the basis of 
ehan<^ or probability, i.e., on the basis of a theory of sampling. 

The nature of this sampling theory may be best explained by 
of a decidedly artificial example. Suppose that by some 
nuracle a scieatast should come into possession of a set of postulates 
nemo oi whidh had ever been employed, but which were believed 
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to satisfy the logical criterion of yielding large numbers of em- 
pirically testable theorems; that a very large number of such 
theorems should be deducible by special automatic logical calcula- 
tion machines; that the theorems, each sealed in a neat capsule, 
should all be turned over to the scientist at once; and, finally, that 
these theorems should then be placed in a box, thoroughly mixed, 
drawn out one at a time, and compared with empirical fact. As- 
suming that no failures of agreement occurred in a long succession 
of such comparisons, it would be proper to say that each 'succeeding 
agreement would increase the probability that the next drawing 
from the box would also result in an agreement, exactly as each 
successive uninterrupted drawing of white marbles from an urn 
would increase progressively the probability that the next drawing 
would also yield a white marble. But just as the probability of 
drawing a white marble will always lack something of certainty 
even with the best conceivable score, so the validation of scientific 
principles by this procedure must always lack something of being 
complete. Theoretical ^^truth” thus appears in the last analysis to 
be a matter of greater or less probability. It is consoling to know 
that this probability frequently becomes very high indeed [S, p. 6) . 

THE ^^TRUTH” status OF LOGICAL PRINCIPLES OR RULES 

Despite much belief to the contrary, it seems likely that logical 
(mathematical) principles are essentially the same in their mode 
of validation as scientific principles; they appear to be merely 
invented rules of symbolic manipulation w^hich have been found by 
trial in a great variety of situations to mediate the deduction of 
existential sequels verified by observation. Thus logic in science 
is conceived to be primarily a tool or instrument useful for the 
derivation of dependable expectations regarding the outcome of 
dynamic situations. Except for occasional chance successes, it 
requires sound rules of deduction, as well as sound dynamic postu- 
lates, to produce sound theorems. By the same token, each obser- 
vationally confirmed theorem increases the justified confidence in 
the logical rules which mediated the deduction, as well as in the 
“empirical” postulates themselves. The rules of logic are more 
dependable, and consequently less subject to question, presumably 
because they have survived a much longer and more exacting period 
of iarial than is the case with most scientific postulates. Probably 
it is because of the widespread and relatively imquestioned ac- 
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ceptanee of the ordinary logical assumptions, and because they 
come to each individual investigator ready-made and usually with- 
out any appended history, that logical principles are so frequently 
regarded with a kind of religious awe as a subtle distillation of the 
human spirit; that they are regarded as never having been, and as 
never to be, subjected to the tests of validity usually applicable to 
ordinary scientific principles; in short, that they are strictly “self- 
evident” truths [S, p. 7). As a kind of empirical confirmation of 
the above view as to the nature of logical principles, it may be noted 
that both mathematicians and logicians are at the present time 
busily inventing;^ modifying, and generally perfecting the principles 
or rules of their disciplines {6). 

SUMMARY 

Modem science has two inseparable components — ^the empirical 
and the theoretical. The empirical component is concerned pri- 
marily with observation; the theoretical component is concerned 
with the interpretation and explanation of observation. A natural 
event is explained when it can be derived as a theorem by a process 
of reasoning from (1) a knowledge of the- relevant natural condi- 
tions antedating it, and (2) one or more relevant principles called 
peculates. Clusters or families of theorems are generated, and 
theorems are often employed in the derivation of other theorems; 
thus is developed a logical hierarchy resembling that found in 
ordinary geometry. A hierarchy of interrelated families of theo- 
rems, all derived from the same set of consistent postulates, con- 
stitute a scientific system. 

Scientific theory resembles argumentation in being logical in 
nainre but differs radically in that the objective of argument is to 
tmvinee. In i^ientific theory logic is employed in conjunction 
with ol^awation as a means of inquiry. Indeed, theoretical pro- 
^iur^ are indispensable in the establishment of natural laws. The 
range of validity of a given supposed law can be determined only 
by trying it out mpiricaliy under a wide range of conditions where 
it will operate in mmultaneous conjunction with the greatest variety 
and cmnbmaticm of other natural laws. But the only way the 
scim^^ trfl frmn tibe outcome of such an empirical procedure 
a hypothetical law has acted in the postulated 

m fir^ to deduce by a logical process what the outcome 
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of the investigation shovld be if the hypothesis really holds. This 
deductive process is the essence of scientific theory. 

The topical procedure in science is to adopt a postulate tenta- 
tively, deduce one or more of its logical implications concerning 
observable phenomena, and then check the validity of the deduc- 
tions by observation. If the deduction is in genuine disagreement 
with observation, the postulate must be either abandoned or so 
modified that it implies no such conflicting statement. If, however, 
the deductions and the observations agree, the postulate gains in 
dependability. By successive agreements under a very wide variety 
of conditions it may attain a high degree of justified credibility, but 
never absolute certainty. 
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CHAPTER II 


Introduction to an Objective Theory of Behavior 

Having examined the general nature of scientific theory, we 
must now proceed to the elaboration of an objective theory as 
applied specifically to the behavior of organisms {10 ) . Preliminary 
to this great and complex task it will be well to consider a number 
of the more general characteristics of organismic behavior, as well 
as certain difficulties which will be encountered and hazards which 
ought to be avoided. 

THU BASIC FACT OF ENVmONMENTAL-ORGAJS’ISMIC INTERACTION 

At the outset of the independent life of an organism there 
begins a dynamic relationship between the organism and its environ- 
ment. For the most part, both environment and organism are 
active; the environment acts on the organism, and the organism 
acts on the environment (5, p, 2) . Naturally the terminal phase of 
any given environmental-organismic interaction depends upon the 
activity of each; rarely or never can the activity of either be pre- 
dicted from knowing the behavior characteristics of one alone. 
The possibility of predicting the outcome of such interaction de- 
pends upon the fact that both environment and organism are part 
of nature, and as such the activity of each takes place according 
to known rules, i.e., natural laws. 

The environment of an organism may conveniently be divided 
into two portions — ^the internal and the external. The external 
aivironment may usefully be subdivided into the inanimate en- 
vironmait and the animate or organismic environment. 

The laws of the internal environment are, for the most part, 
of the physiology of the particular organism. The laws of 
the inanimate environment are those of the physical world and 
eonsfetute the critical portions of the physical sciences; they are 
relatively simple and reasonably well known. 

The laws of the organismic environment are those of the be- 
havior of other organisms, especially organisms of the same species 
m the one under consideration; they make up the primary prin- 
ciple of the tehavior, or ''social,” sciences and are comparatively 

16 



17 


NATURE OF OBJECTIVE BEHAVIOR THEORY 

complex. Perhaps because of this complexity they are not as yet 
very well understood. Since in a true or symmetrical social situa- 
tion only organisms of the same species are involved, the basic laws 
of the activities of the environment must be the same as those of 
the organism under consideration. It thus comes about that the 
objective of the 'present work is the elaboration of the basic molar ^ 
behavioral lams underl'ying the “social” sciences, 

ORGAKISMIC NEED, ACnVITY, AND SURVIVAL 

Since the publication by Charles Darwin of the Origin of Species 
{^) it has been necessary to think of organisms against a back- 
ground of organic evolution and to consider both organismic struc- 
ture and function in terms of survival. Survival, of course, applies 
equally to the individual organism and to the species. Physio- 
logical studies have shown that survival requires special circum- 
stances in considerable variety; these include optimal conditions of 
air, w^ater, food, temperature, intactness of bodily tissue, and so 
forth; for species survival among the higher vertebrates there is 
required at least the occasional presence and specialized reciprocal 
behavior of a mate. 

On the other hand, when any of the commodities or conditions 
necessary for individual or species survival are lacking, or when 
they deviate materially from the optimum, a state of primary need 
is said to exist. In a large proportion of such situations the need 
will be reduced or eliminated only through the action on the en- 
vironment of a particular sequence of movements made by the 
organism. For example, the environment will, as a rule, yield a 
commodity (such as food) which will mediate the abolition of a 
state of need (such as hunger) only when the movement sequence 
corresponds rather exactly to the momentary state of the environ- 
ment; i.e., when the movement sequence is closely synchronized 
with the several phases of the environmental reactions. If it is to 
be successful, the behavior of a hungry cat in pursuit of a mouse 
must vary from instant to instant, depending upon the movements 
of the mouse. Similarly if the mouse is to escape the cat, its move- 

^By this expression is meant the uniformities discoverable among the 
grossly observable phenomena of behavior as contrasted with the laws of the 
behavior of the ultimate “molecules” upon which this behavior depends, 
such as the constituent cells of nerve, muscle, gland, and so forth. The term 
molar thus means coarse or macroscopic as contrasted with molecular, or 
microscopic. 
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ments must vary from instant to instant, depending upon the move- 
ments of the cat. 

Moreover, in a given external environment situation the be- 
havior must often differ radically from one occasion to another, 
depending on the need which chances to be dominant at the time; 
e.g., whether it be of food, water, or a mate. In a similar manner 
the behavior must frequently differ widely from one environmental 
situation to another, even when the need is exactly the Same in each 
environment; a hungry man lost in a forest must execute a very 
different sequence of movements to relieve his need from what 
would be necessary if he were in his home. 

It follows from the above considerations that an organism mil 
hardly swrvive unless the state of organismic need and the state of 
the environment in its relation to the organism are somehow jointly 
and dmyltaneoicsly brought to hear upon the movement-producing 
mechanism of the organism, 

THE OBGAN-IC BASIS OF ADAPTIVE BEHAVIOR 

All normal higher organisms possess a great assortment of 
muscles, usually with bony accessories. These motor organs are 
ordinarily adequate to mediate the reduction of most needs, pro- 
vided their contractions occur in the right amount, combination, and 
sequence. The momentary status of most portions of the environ- 
ment with respect to the organism is mediated to the organism by 
an immense number of specialized receptors which respond to a 
(N}nsiderable variety of energies such as light waves (vision) , soimd 
waves (hearing), gas^ (smell), chemical solutions (taste), me- 
chanical impacts (touch), and so on. The state of the organism 
it^f (the internal environment) is mediated by another highly 
specially seri^ of receptors. It is probable that the various 
ecmditions of need also fall into this latter category; i.e., in one 
way or another needs activate more or less characteristic receptor 
much as do external environmental forces. 

Neur^ impul^ set in motion by the action of these receptors 
p^m along separate nerve fibers to the central ganglia of the 
n^vous system, notably the brain. The brain, which acts as a 
kimi of aukmahc switchboard, together with the remainder of the 
^sten, routes and distributes the impulses to indi- 
TOiu^ uncles and ^ands in rather precisely graded amoimts and 
Whm tl^ neural impul^ reaches an effector organ 
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(muscle or gland) the organ ordinarily becomes active, the amount 
of activity usually varjung with the magnitude of the impulse. 
The movements thus brought about usually result in the elimination 
of the need, though often only after numerous unsuccessful trials. 
But organismic activity is by no means always successful ; not in- 
frequently death occurs before an adequate action sequence has 
been evoked. 

It is the primary task of a molar science of behavior to isolate 
the basic laws or rules according to which various combinations of 
stimulation, arising from the state of need on the one hand and 
the state of the environment on the other, bring about the kind of 
behavior characteristic of different organisms. A closely related 
task is to understand why the behavior so mediated is so generally 
adaptive, i.e., successful in the sense of reducing needs and facili- 
tating survival, and why it is xmsuccessful on those occasions when 
suridval is not facilitated. 


THE NEUROLOGICAL tTJRSUS THE MOLAR APPROACH 

From the foregoing considerations it might appear that the 
science of behavior must at bottom be a study of physiology. In- 
deed, it was once almost universally believed that the science of 
behavior must wait for its useful .elaboration upon the development 
of the subsidiary science of neurophysiology. Partly as a result of 
tliis belief, an immense amount of research has been directed to 
the understanding of the detailed or molecular dynamic laws of 
this remarkable automatic structure. A great deal has been re- 
vealed by these researches and the rate of development is constantly 
being accelerated by the discovery of new and more effective 
methods of investigation. Nearly all serious students of behavior 
like to believe that some day the major neurological laws will be 
known in a form adequate to constitute the foundation principles 
of a science of behavior. 

In spite of these heartening successes, the gap between the 
minute anatomical and physiological account of the nervous system 
as at present known and what would be required for the construc- 
tion of a reasonably adequate theory of molar behavior is im- 
passable. The problem confronting the behavior theorist is sub- 
stantially like that which would have been faced by Galileo and 
Newton had they seriously considered delaying their preliminary 
formulation of the molar mechanics of the physical world until the 
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micro-mechanics of the atomic and subatomic world had been 
satisfactorily elaborated. 

Students of the social sciences are presented with the dilemma 
of waiting until the physico-chemical problems of neurophysiology 
have been adequately solved before beginning the elaboration of 
behavior theory, or of proceeding in a provisional manner with 
certain reasonably stable principles of the coarse, macroscopic or 
molar action of the nervous system whereby movements are evoked 
by stimuli, particularly as related to the history of the individual 
organism. 

There can hardly be any doubt that a theory of molar behavior 
founded upon an adequate knowledge of both molecular and molar 
principles would in general be more satisfactory than one founded 
upon molar considerations alone. But here again the history of 
physical science is suggestive. Owing to the fact that Galileo and 
Newton carried out their molar investigations, the world has had 
the use of a theory which was in very close approximation to obser- 
vations at the molar level for nearly three hundred years before 
the development of the molecular science of modern relativity and 
quantum theory. Moreover, it is to be remembered that science 
proceeds by a series of successive approximations; it may very well 
be that had Newton^s system not been worked out when it was 
there would have been no Einstein and no Planck, no relativity and 
no quantum theory at all. It is conceivable that the elaboration 
of a systematic science of behavior at a molar level may aid in the 
development of an adequate neurophysiology and thus lead in the 
end to a truly molecular theory of behavior firmly based on 
physiology. 

It happens that a goodly number of quasi-neurological principles 
have now been determined by careful experiments designed to trace 
out the relationship of the molar behavior of organisms, usually 
as int^ratoi whol^, to well-controlled stimulus situations. Many 
of the more promising of these principles were roughly isolated in 
the first instance by the Russian physiologist, Pavlov, and his 
pupils, by means of conditioned-reflex experiments on dogs. More 
rc^^tly extensive experiments in many laboratories in this country 
with all kinds of ructions on a wide variety of organisms, includ- 
ing 23 aan, have greatly extended and rectified these principles and 
dwwn how tiiey operate jointly in fhe production of the more 
comply fom^ of brfiavior. Because of the pressing nature of 
WiaTOir pitrfjlems, both practical and theoretical aspects of be- 
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havior science are, upon the whole, being developed according to the 
second of the two alternatives outlined above. For these reasons 
the molar approach is employed in the present work. 

In this connection it is to be noted carefully that the alternatives 
of microscopic versus macroscopic, and molecular versus molar, are 
relative rather than absolute. In short, there are degrees of the 
molar, depending on the coarseness of the ultimate causal segments 
or units dealt with. Other things equal, it would seem wisest to 
keep the causal segments small, to approach the molecular, the 
fine and exact substructural details, just as closely as the knowledge 
of that substructure renders possible. There is much reason to 
believe that the seeming disagreements among current students of 
beha\’ior may be largely due to the difference in the degree of the 
molar at which the several investigators are working. Such dif- 
ferences, however, do not represent fundamental disagreements. 
In the end the work of all who differ only in this sense may find 
a place in a single systematic structure, the postulates or primary 
assumptions of those working at a more molar level ultimately 
appearing as theorems of those working at a more molecular level. 

THE ROLE OF INTER\TNING VARIABLES IN BEHAVIOR THEORY 

Wherever an attempt is made to penetrate the invisible world 
of the molecular, scientists frequently and usefully employ logical 
constructs, intervening variables, or symbols to facilitate their 
thinking. These symbols or represent entities or processes 
which, if existent, would accoimt for certain events in the observ- 
able molar world. Examples of such postulated entities in the field 
of the physical sciences are electrons, protons, positrons, etc. A 
closely parallel concept in the field of behavior familiar to everyone 
is that of habit as distinguished from habitual action. The habit 
presumably exists as an invisible condition of the nervous system 
quite as much when it is not mediating action as when habitual 
action is occurring; the habits upon which swimming is based are 
just as truly existent when a person is on the dance floor as when 
he is in the water. 

In some cases there may be employed in scientific theory a 
whole series of hypothetical unobserved entities; such a series is 
presented by the hierarchy of postulated physical entities: mole- 
cule, atom, and electron, the molecule supposedly being constituted 
of atoms and the atom in its turn being constituted of electrons. 
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A rough parallel to this chain of hypothetical entities from the 
physical sciences will be encountered in the present system of be- 
havior theory. For the above reasons the subject of symbolic con- 
structs, intervening variables, or hypothetical entities which are not 
directly observable requires comment (6, p. 3 ff.) . 

Despite the great value of logical constructs or intervening 
variables in scientific theory, their use is attended with certain 
difiBculties and even hazards. At bottom this is because the pres- 
ence and amount of such hypothetical factors must always be 
determined indirectly. But once (1) the dynamic relationship 
existing between the amount of the hypothetical entity (Z) and 
some antecedent determining condition (A) which can be directly 
observ^ed, and (2) the dynamic relationship of the hypothetical 
entity to some third consequent phenomenon or event (B) which 
also can be directly observed, become fairly well known, the scien- 
tific hazard largely disappears. The situation in question is repre- 
sented in Figure 1. When a hypothetical dynamic entity, or even 

A 

Fig. 1 . Diagrammatic reprei^ntation of a relatively simple case of an in- 
tairening variable (X) not directly observable but functionally related (/) to 
tiie antecedent event (A) and to the consequent event (R), both A and B 
being directly observable. When an intervening variable is thus securely 
anchored to observables on both sides it can be safely employed in scientific 
theory. 

a chain of such entities each fimctionally related to the one logically 
pr^eding and following it, is thus securely anchored on both sides 
to observable and measurable conditions or events (A and B), the 
main theoretical danger vanishes. This at bottom is because under 
the assum^ circumstances no ambiguity can exist as to when, and 
how much of, B should follow A. 

THB OBJICTEVB VERSUS THE SUBJECTIVE APPROACH TO 
BEHAVIOR THEORY 

If the circumstances sketched above as surrounding and safe- 
guarding the of hypothetical entities are not observed, the 
fallacies may be committed. The painfully slow path 
wha:eby man ha^ as of yesterday, begun to emerge into the truly 
scimlific is litter^ with such blunders, often tragic in their 
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practical consequences. A pestilence or a hurricane descends upon 
a \dllage and decimates the population. The usual hypothesis put 
forward by primitive man (and many others who think themselves 
not at all primitive) to explain the tragic event (B) is that some 
hypothetical spirit (X) has been angered by the violation (A) of 
some tribal taboo on the part of one or more inhabitants of the 
village. Unfortunately this mode of thinking is deeply ingrained 
in most cultures, not excepting our own, and it even crops up under 
various disguises in what purports to be serious scientific -work. 

Perhaps as good an example of such a fallacious use of the 
intervening variable as is offered by recent scientific history is that 
of the mtelechy put forward hf Hans Driesch as the central con- 
cept in his theory of vitalism (5). Driesch says, for example: 

A supreme mind, conversant with the inorganic facts of nature and 
knowing all the intensive manifoldness of all entelechies and psychoids . . . 
would be able to predict the mdividual history of the latter, would be able 
to predict the actions of any psychoid with absolute certainty. HuTuan 
mind, on the other hand, is not able to predict in this way, as it does not 
know entelechy before its manif^tation, and as the material conditions 
of life, which alone the mind of man am Imow ... in its completeness, are 
not the only conditions responsible for organic phenomena. [S, p. 24:9.) 

Driesch^s entelechy (X) fails as a logical construct or intervening 
variable not because it is not directly observable (though of course 
it is not) , but because the general functional relationship to ante- 
cedent condition A and that to consequent condition B are both 
left unspecified. This, of course, is but another way of saying that 
the entelechy and all similar constructs are essentially metaphysical 
in nature. As such they have no place in science. Science has no 
use for unverifiable hypotheses. 

A logically minded person, unacquainted with the unscientific 
foibles of those who affect the scientific virtues, may naturally 
wonder how such a formulation could ever mediate a semblance of 
theoretical prediction and thus attain any credence as a genuinely 
scientific theory. The answer seems to lie in the inveterate animis- 
tic or anthropomorphic tendencies of human nature. The entelechy 
is in substance a spirit or daemon, a kind of vicarious ghost. The 
person employing the entelechy in effect says to himself, ^Tf I were 
the entelechy in such and such a biological emergency, what would 
I do?” Emowing the situation and what is required to meet the 
emergency, he simply states what he knows to be required as a 
solution, and he at once has in this statement what purports to be 
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a scientific deduction! He has inadvertently substituted himself 
in place of the construct and naively substituted his knowledge of 
the situation for the objective rules stating the functional rela- 
tionships which ought to subsist between A and X on the one hand, 
and between X and B on the other. 

This surreptitious substitution and acceptance of one's knowl- 
edge of what needs to be done in a biological emergency for a 
theoretical deduction is the essence of what we shall call anthro- 
pomorphism, or the subjective, in behavior theory. After many 
centuries the physical sciences have largely banished the subjective 
from their fields, but for various reasons this is far less easy of 
accomplishment and is far less well advanced in the field of be- 
havior. The only known cure for this unfortunate tendency to which 
ail men are more or less subject is a grim and inflexible insistence 
that all deductions take place according to the explicitly formulated 
rules stating the functional relationships of A to X and of X to B. 
This latter is the essence of the scientifically objective. A genu- 
inely scientific theory no more needs the anthropomorphic intuitions 
of the theorist to eke out the deduction of its implications than an 
automatic calculating machine needs the intuitions of the operator 
in the determination of a quotient, once the keys representing the 
dividend and the divisor have been depressed. 

Objective scientific theory is necessary because only under ob- 
jective conditions can a principle be tested for soundness by means 
of observation. The basic difficulty with anthropomorphic sub- 
j^tivism is that what appear to be deductions derived from such 
formulations do not originate in rules stating postulated functional 
relationships, but rather in the intuitions of the confused thinker. 
Ol^ervational check of such pseudo-deductions may verify or refute 
iJi^e intuitions, but has no bearing on the soundness of any scien- 
tific principles whatever; such verifications or refutations might 
properly increa^ the reputation for accurate prophecy of the one 
making mieh intuitive judgments, but a prophet is not a principle, 
much le^ a scientific theory, 

OBJECTIVISM VERSUS TELEOLOGY 

Even a superficial study of higher organisms shows that their 
behavior occurs in cycles. The rise of either a primary or a sec- 
^dary nmi normally marks the beginning of a behavior cycle, 
mkA ilm abolition, or sul^tantial reduction of that need marks its 
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end. Some phase of the joint state of affairs resulting from the 
environmental-organismic interaction at the end of a behavior cycle 
is customarily spoken of as a goal. Our usual thoughtless custom 
is to speak of cycles of behavior by merely naming their outcome, 
effect, or end result, and practically to ignore the various move- 
ments which brought this terminal state about. Guthrie has ex- 
pressed this tendency more aptly than anyone else { 4 , p. 1). We 
say quite naturally that a man catches a fish, a woman bakes a 
cake, an artist paints a picture, a general wuns a battle. The end 
result of each angling exploit, for example, may be in some sense 
the same but the actual movements involved are perhaps never 
exactly the same on any two occasions; indeed, neither the angler 
nor perhaps anyone else knows or could know in their ultimate 
detail exactly what movements were made. It is thus inevitable 
that for purposes of communication we designate behavior se- 
quences by their, goals. 

Now for certain rough practical purposes the custom of naming 
action sequences by their goals is completely justified by its con- 
venience. It may even be that for very gross molar behavior it 
can usefully be employed in theory construction, provided the 
theorist is alert to the naturally attendant hazards. These appear 
the moment the theorist ventures to draw upon his intuition for 
statements concerning the behavior (movements) executed by the 
organism between the onset of a need and its termination through 
organismic action. Pseudo-deductions on the basis of intuition bom 
of intimate knowledge are so easy and so natural that the tendency 
to make them is almost irresistible to most persons. The practice 
does no harm if the theorist does not mistake this subjective intui- 
tional performance for a logical deduction from an objective theory, 
and attribute the success of his intuitions to the validity of the 
theoretical principles. 

An ideally adequate theory even of so-called purposive behavior 
ought, therefore, to begin with colorless movement and mere 
receptor impulses as such, and from these build up step by step 
both adaptive behavior and maladaptive behavior. The present* 
approach does not deny the molar reality of purposive acts (as 
opposed to movement), of intelligence, of insight, of goals, of 
intents, of strivings, or of value; on the contrary, we insist upon 
the genuineness of these forms of behavior. We hope ultimately to 
show the logical right to the use of such concepts by deducing them 
as secondary principles from more elementary objective primary 
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principles. Once they have been derived we shall not only under- 
stand them better but be able to use them with more detailed 
effectiveness, particularly in the deduction of the movements which 
mediate (or fail to mediate) goal attainment, than would be the 
case if we had accepted teleological sequences at the outset as 
gross, unanalyzed (and unanalysable) wholes. 

“emeegentism” a doctrine or despair 

Perhaps the very natural and economical mode of communica- 
tion whereby we speak of the terminal or goal phases of action, 
largely regardless of the antecedent movements involved, predis- 
poses us to a belief in teleology. In its extreme form teleology is 
the name of the belief that the terminal stage of certain environ- 
mental-organismic interaction cycles somehow is at the same time 
one of the antecedent determining conditions which, bring the be- 
havior cycle about. This approach, in the case of a purposive 
behavior situation not hitherto known to the theorist, involves a 
kind of logical circularity: to deduce the outcome of any be- 
havioral situation in the sense of the deductive predictions here 
under consideration, it is necessary to know all the relevant ante- 
cedent conditions, but these cannot be determined until the be- 
havioral outcome has been deduced. In effect this means that the 
task of deduction cannot begin tmtil after it is completed! 
Naturally this leaves the theorist completely helpless. It is not 
surprising that the doctrine of teleology leads to theoretical despair 
and to such pseudo-remedies as vitalism and emergentism. 

Emeigentism, as applied to organismic behavior, is the name 
for the view that in the process of evolution there has "emerged” 
a form of behavior which is ultimately unanalysable into logically 
more primitive elements— behavior which cannot possibly be de- 
duced from any logically prior principles whatever. In particular 
it is held that what is called goal or purposive behavior is of such 
a nature, that it cannot be derived from any conceivable set of 
postulates involving mere stimuli and mere movement (S, pp. 7-8; 
7 , K>. 26-27). 

On the other hand, many feel that this defeatist attitude is not 
(ally unwholescune in that it discourages scientific endeavor, but 
tiiat it is quite unjustified by the facts. The present writer shares 
view. Tharefore a serious attempt will ultimately be made to 
draw that Ih^e EupposwUy nnpo®ible derivations are actually pos- 
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sible; in some cases they will be shown to be quite easy of accom- 
plishment, 

A SUGGESTED PROPHYLAXIS AGAINST ANTHROPOMdRPHIC 
SUBJECTIVISM 

As already suggested, one of the greatest obstacles to the 
attainment of a genuine theory of behavior is anthropomorphic 
subjectmsm. At bottom this is because we ourselves are so inti- 
mately involved in the problem; we are so close to it that it is 
diiEcult to attain adequate perspective. For the reader who has 
not hitherto struggled with the complex but fascinating problems 
of behavior theory, it will be hard to realize the difficulty of main- 
taining a consistently objective point of view. Even when fully 
aware of the nature of anthropomorphic subjectivism and its dan- 
gers, the most careful and experienced thinker is likely to find 
himself a victim to its seductions. Indeed, despite the most con- 
scientious effort to avoid this it is altogether probable that there 
may be found in various parts of the present work hidden elements 
of the anthropomorphically subjective. 

One aid to the attainment of behavioral objectivity is to think 
in terms of the behavior of subhuman organisms, such as chim- 
panzees, monkeys, dogs, cats, and albino rats. Unfortunately this 
form of prophylaxis against subjectmsm all too often breaks down 
when the theorist begins thinking what he would do if he were a 
rat, a cat, or a chimpanzee; when that happens, all his knowledge 
of his own behavior, bom of years -of self-observation, at once 
begins to function in place of the objectively stated general rules 
or principles which are the proper substance of science. 

A device much employed by the author has proved itself to be 
a far more effective prophylaxis. This is to regard, from time to 
time, the behaving organism as a completely self-maintaining robot, 
constructed of materials as unlike ourselves as may be. In doing 
this it is not necessary to attempt the solution of the detailed 
engineering problems connected with the design of such a creature. 
It is a wholesome and revealing exercise, however, to consider the 
various general problems in behavior dynamics which must be 
solved in the design of a truly self-maintaining robot. We, in 
common with other mammals, perform innumerable behavior adap- 
tations with such ease that it is apt never to occur to us that any 
problem of explanation exists concerning them. In many such 
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seemingly simple activities lie dynamical problems of very great 
complexity and diflScuIty. 

A second and closely related subjective tendency against whicli 
the robot concept is likely to prove effectively prophylactic is that 
to the reification of a behavior function. To reify a function is to 
give it a name and presently to consider that the name represents 
a thing, and finally to believe that the thing so named somehow 
explains the performance of the function. We have already seen 
an example of this unfortunate tendency in Driesch's entelechy. 
The temptation to introduce an entelechy, soul, spirit, or daemon 
into a robot is slight; it is relatively easy to realize that the intro- 
duction of an entelechy would not really solve the problem of design 
"of a robot because there would still remain the problem of designing 
the entelechy itself^ which is the core of the original problem all 
over again. The robot approach thus aids us in avoiding the very 
natural but childish tendency to choose easy though false solutions 
to our problems, by removing all excuses for not facing them 
squarely and without evasion. 

Unfortunately it is possible at present to promise an explanation 
of only a portion of the problems encountered in the infinitely 
complex subject of organismic behavior. Indeed, it is no great 
exaggeration to say that the present state of behavior theory re- 
sembles one of those pieces of sculpture which present in the main 
a rough, unworked block of stone with only a hand emerging in 
low relief here, a foot or thigh barely discernible there, and else- 
where a part of a face. The undeveloped state of the behavior 
sciences suggested by this analogy is a source of regret to the 
behavior theorist but not one of chagrin, because incompleteness 
is characteristic even of the most advanced of all theoretical 
sciences. From this point of view the difference between the physi- 
cal and the behavioral sciences is one not of kind but of degree — 
of the relative amount of the figure still embedded in the unhewn 
rock. There is reason to believe that the relative backwardness 
of toe behavior sciences is due not so much to their inherent com- 
plexity as to the difiSculty of maintaining a consistent and rigorous 
obj^tiviam. 

SUMMARY 

The fidd of l^havior theory centers primarily in the detailed 
intoracMen of organism and environment. The basic principles of 
^^ganianic behavior are to be viewed against a background of 
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organic evolution, the success or failure of the evolutionary process 
being gauged in terms of survival. Individual and species survival 
depend upon numerous optimal physiological conditions; when one 
of these critical conditions deviates much from the optimum, a 
state of primary need arises. Need reduction usually comes about 
through a particular movement sequence on the part of the or- 
ganism. Such sequences depend for their success jointly upon the 
nature of the need and the nature and state of the environment. 

The condition of organismic need and the status of the environ- 
ment evoke from specialized receptors neural impulses which are 
brought to bear jointly on the motor organs by the central ganglia 
of the nervous system acting as an automatic switchboard. The 
primary" problem of behavior theory is to discover the laws accord- 
ing to which this extraordinarily complex process occurs. Students 
of behavior have resorted to the coarse, or “molar,” laws of neural 
activity as revealed by conditioned-reflex and related experiments, 
rather than to the “molecular” results of neurophysiology, because 
the latter are not yet adequate. 

Perhaps partly as the result of this molar approach, it is found 
necessarv' to introduce into behavior theory numerous logical con- 
structs analogous to molecules and atoms long used in the physical 
sciences. Ail logical constructs present grave theoretical hazards 
when they are not securely anchored to directly observable events 
both as antecedents and consequences by definite functional rela- 
tionships. Under conditions of unstated functional relationships 
the naive theorist is tempted to make predictions on the basis of 
intuition, which is anthropomorphic subjectivism. The derivation 
of theoretical expectations from explicitly stated functional rela- 
tionships is the objective method. Experimental agreement with 
expectations can properly validate theoretical principles only when 
objective procedures are employed. 

Some writers believe that there is an impassable theoretical 
gulf between mere muscle contraction and the attainment of goals; 
that the latter are “emergents.” This doctrine of despair grows 
naturally out of the doctrine of teleology. The present treatise 
accepts neither teleology nor its pessimistic corollary. Goals, in- 
tents, intelligence, insight, and value are regarded not only as 
genuine but as of the first imi>ortance. Ultimately an attempt will 
be made to derive all of these things objectively as secondary 
phenomena from more elementary objective conditions, concepts, 
and principles. 
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NOTES 

Operational Definitions and Intervening Variables 

In 19S8 Bridgman, a physicist whose chief research activities have been con- 
cerned with the empirical determination of various physical phenomena under 
very great pressures, wrote a book (1) in which he made an acute examination 
of the use of various concepts in current physical theory, particularly those 
representing intervening variables. The cure which he reco m mended for such 
abuses as he found was the scrupulous recognition of ‘the operations carried out 
by the experimentalists as a means to the making of the observations and measure- 
ments of the observable events (A and B, Figure 1). This, as we saw above, has 
^)ecial significance for the science of behavior, which is so prone to the subjective 
u^ of intervening variables. Quite naturally and properly, Bridgman’s work 
has gready impressed many psychologists. Unfortunately his emphasis upon 
the operations which are the means whereby the observations and measurements 
in question become possible has led many psychologists to mistake the means 
for the end. The point here to be eniphasked is that while observations must be 
©msidered in the context of the operations which make them possible, the central 
factor in the situation is what is observed. The moral of Bridgman’s treatise is 
that the intervening variable (X) is never directly observed but is an inference 
based on the ob^rvation of something else, and that the inference is criticaJly 
dependent upon the experimental manipulations (operations) which lead to the 
observations. An emphasis on operations which ignores the central impoi^nce 
of the dependent observations completely misses the virtue of what is coining to 
be known as operationism. 


The Subjective Versus the Objective in Behavior Theory 

The critical characteristic of the subjective as contrasted with the objective 
m timt the subjective tends to be a private event, whereas the objective is a public 
evmt, Le., an event presumed to be independently observable by many persons. 
Thus the perceptiial experience or conscious feeling of a person when stimulated 
by li^t rays of a certain wave length is said to be a private or subjective event, 
whereas the K^t rays themselves, or the overt behavior of another person in 
p^poni^ to tile inq^act of the light rays, is said to be a public or objective event. 

A typical ca^ of subjectivism in the field of theory, on the other hand, is one 
in which ti^ theorist asserts, and even believes, that he has deduced a 

prc^KKaticm m a logi^ manner, whereas in fact he has arrived at it by mere 
imtiiropoixKwpMc intuition. The subjectivism of behavior theory is thus de- 
upon a Mad of privacy, but one quite different from that of perceptual 
(xc caqper^ce. The subjective aspect of experience is dependent 
private xmture of the process hidden within the body of the sttbject; suh- 
in fi^i of behavior tiieory, on the other hand, is dependent uj)on 
ti» privahs mhxre oi the within tiie body of the the&ristf whereby he 

to expdam the behavior of the sibjeet. A theory becomes objective 
primary as^imptions and the lo^cal steps whereby these assumptions 
fead to lurtite prop^tior^ (theorems) are exhibited to public observation and 
80 iQ^e pc^sbte a Mxri of repetition of the logical proc^ by any other person. 
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Propositioiis origmating in private intuitions masquerading as unstated logical 
processes are, of course, not theoretical material at all, and have no proper place 
in science. 

ffistorical Note Concerning the Concept of Molar Behavior 
and of the Intervening Variable 

The important concept of molar, as contrasted with molecular, behavior was 
introduced into psychology in 1931 by E. C. Tohnan. The present writer has 
taken over the concept substantially as it appears in Tolman's well-known 
book (3). 

The explicit introduction into psychology of the equally important concept 
of the intervening variable is also due to Professor Tohnan; its first and best 
elaboration was given in his address as President of the American Psychological 
Association, delivered at Minneapolis, September 3, 1937 (5). 
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CHAPTER III 


Stimulus Reception and Organism Survival 

The intimate relation of the motility of organisms to survival 
has repeatedly been emphasized (p. 17). It has also been pointed 
out (p. 18) that if the action of organisms is to facilitate survival, 
movement must vary in an intimate manner not only with the state 
of the need but with the exact state of both the internal and the 
external environment at the instant of action occurrence. This 
means that the survival of the organism usually requires a precise 
integration of the animal’s motor organs with its environment. 
The means whereby the various stimulating energies of the environ- 
ment are mediated to the nervous system, the central integrating 
mechanism of organisms, must therefore be examined. 

SOME TEEMINOLOGICAL CLAEIEICATIONS 

Owing in part to the historical contamination of behavior 
science by metaphysical speculation, certain ambiguities and mis- 
understandings have arisen concerning the meanings of terms com- 
monly employed to indicate phenomena associated with receptor 
activity and functioning. We shall accordingly state our own use 
of these terms, employing the visual receptor for purposes of illus- 
tration. 

Light is believed by most physicists to be a wave phenomenon, 
the different wave lengths (or frequencies) being reflected from the 
surfaces of objects. Consider, for example, the large, red celluloid 
die represented in Figure 2, against a background of black velvet. 
Hie (he itself we shall call a stimulus object. From the surface of 
this stimulus object, wave lengths of approximately 650 milli- 
microns, say, are reflected in all directions except where opaque 
otetructions exist. From the white spots, waves of all lengths are 
reflected. The amount of the light reflected in a given direction 
wfll vary jointly with the angle of the surface involved and the 
direction of the light source. Any and all of the rays of light re- 
teited from the die are potential stimuli, depending upon whether 
<H- not a responsive receptor chances to be in such a position as to 
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receive them; in the latter case the light ray becomes an actiuil 
stimvlm (see Figure 2). 

Suppose, now, that the die were to be rotated slowly on its 
vertical axis as it stands in Figure 2. It is clear that the retina 



Fxg- 2. Diagrammatic representation of a stimulus’ object, a sheaf of 
potential stimuli, an actual stimulus (Si), and the sense organ of the observer 
upon which the actual stimulus impinge (5). 

would receive the impact of a gradually changing pattern or com- 
bination of light waves, each of the infinite number of angles re- 
sulting from the rotation reflecting a different configuration of 
stimulated points on the retina. Such a series of compound stimu- 
lations is called a stimulus continuum, 

SOME TYPICAL RECEPTORS AND THEIR ADAPTIVE FUNCTION'S 

The processes of organic evolution have solved the primary prob- 
lem of mediating to the organism the differential nature and state 
of the enwonment by developing specially differentiated organs 
each of which is normally and primarily stimulated only by a 
limited range of environmental energy. The normal individuals 
of the higher mammalian species possess receptors which respond 
to the stimulation of most forms of energy critically active during 
the period of evolution. Generally speaking, a distinct receptor 
organ is available for each energy type. In this way the several 
vitally important energy manifestations of the environment are 
qualitatively differentiated. We proceed now to a brief survey of 
the more important of these receptors, with special reference to the 
biolo^cal function performed by each. 

Biologically, one of the most primitive and necessary of the 
receptors is that responsive to contact or mechanical pressure. 
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The end organs are appropriately located in the skin or mem- 
branes susceptible to contact. An interesting extension of the con- 
tact receptor system beyond the surface of the organism is brought 
about by the fact that on the hairy parts the touch receptor is 
placed at the base of the hairs in such a way that when the shaft 
of hair is moved the receptor is activated. Since an object is likely 
to touch a hair before it reaches the skin, we have here a beginning 
of distance reception. This adaptive device is developed still 
further in certain organisms such as cats and rats in the long and 
relatively stiff hairs which extend outward from the face, mostly 
in the re^on of the nose. These extended or indirect touch recep- 
tors become very useful distance receptors in darkness, even though 
their range be small. 

One of the many necessities of the mammalian body is the 
maintenance of an optimum temperature. This means that recep- 
tors must be provided which will yield differential neural impulses 
when the environment is such as to cool the skin below, or to 
raise it above, a certain temperature. Instead of giving us a single 
thermometer to act on the thermostat principle, nature seems to 
have evolved two receptors, one for temperatures above about 
33.5 G. degrees, and another for temperatures below about 32.5 C. 
d^rees, depending somewhat upon circumstances. The receptor 
organs for temperature have not yet been determined with complete 
certainty (6, p. 1053 ff.). 

Perhaps the most imperative receptor need of the organism is 
to have organs responsive to a state of injury within the internal 
environment, i.e., to the destruction of tissue or to situations which 
if intensified or long continued would result in the destruction of 
tissue. The process of organic evolution has provided the higher 
oaganisms with an abundant supply of such end organs. They are 
caU^ nociceptors. The hollow organs of the viscera 
are provided with nociceptors which are activated by persistent 
distention. The organs responsive to tissue injury, if any, also 
imve not yet been determined with certainty (d, p. 1065 ff.). 

In ord^ tiiat the organism may behave differentially to good 
and bad fcxKi, it is clear that receptors yielding differential neural 
rt^ctions to mieh substances must be provided. Nature has solved 
problem by evolving chemical receptors for liquids — ^the gusta- 
tory r^ei^r oag^is, or taste buds, located mostly along the sides 
^d at the of the tongue. The location of these end organs 
m tite mouto m highly adaptive. They will not be activated by 
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sapid substances unless the latter are about to be ingested, yet the 
process of ingestion will not have gone so far at the time of end 
organ activation that rejection will not be easily possible (d, 
p. l(K>5ff.). 

The smell or olfactory receptors are activated by certain gases 
Quite appropriately the olfactory end organs are found in an upper 
chamber of the nose where they will be in contact with eddies from 
a constantly changing sample of the aerial environment of the 
organism, incidental to the breathing activities (d, p. 992). The 
olfactory receptor is in a favorable position for guarding the ali- 
mentary canal against undesirable substances and for guiding to 
suitable food substances. Since gases characteristic of a substance 
are apt to surround it for some distance, the receptor’s response to 
these may evoke rejection before the substance even enters the 
mouth (d, p. 992ff.). 

TBCB RECEPnOJr OP MOVEMENT 

An exceedingly important sense mediating one aspect of the 
internal environment is that known as proprioception, or kinaes- 
thesis. The receptors of this sense organ lie mainly in the muscles 
and joint capsules. Since they are stimulated by movements of 
either the muscles or the joints, these receptors become of special 
value in mediating all forms of activity involving a considerable 
degree of muscular coordination (d, p. 1072 £f.) . Ultimately we 
shall see reason to believe that this seemingly humble and obscure 
receptor plays an indispensable role in the most complex and subtle 
activities ever evolved by nature — ^those of symbolic behavior or 
thought. 

In the continued survey of movement reception we find that 
nature has evolved a remarkable organ which gives differential 
and characteristic receptor response to angular or progressive mo- 
tions of the head, whether the motion he active or passive. The 
ultimate receptors lie within the labyrinth (d, p. 204 ff.). This 
structure consists in the main of three small semi-circular canals 
placed at right angles to each other within the head and intimately 
related topologically to the internal ear. When the head is turned 
through an angle or moved progressively, the receptor organs within 
the labyrinth are stimulated in a distinctive manner, in this way 
initiating characteristic neural impulses. The particular receptor 
organs influenced by a given turning movement of the head will 
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depend largely upon the position in which the head is supported 
by the neck muscles. Thus adaptive reactions to angular or 
progressive movements of the body as a whole must be based on 
the combination or pattern (see p. 44 ff. and p. 349 if.) of the 
neural impulses arising in the labyrinth and those arising in the 
proprioceptive end organs in the neck and other parts of the body. 
It is accordingly not accidental that the nerve fibers extending to 
the brain from both the labyrinth and the muscles largely enter 
the same general portion of the brain, namely, the cerebellum. 

THE EECEPTIOH OF THE SPATIAL KELATIONSHIPS 

Even a cursory observation of organisms shows that their sur- 
vival depends upon their behavior being coordinated not only to 
the objects and substances in their environment, but to the dis- 
tances and directions of these from the organism and from each 
other. We have already noted an exceedingly crude distance re- 
ceptor in hairs, some of which are specialized for this function. 
Moreover, the proprioceptive and labyrinthine receptors are clearly 
connected with active and passive movement, and so are closely 
related to distance and direction. 

The olfactory receptor must also be mentioned in this connec- 
tion. Certain bodies, substances, and organisms give off gases, and 
these gases spontaneously diffuse through the atmosphere. In gen- 
eral the concentration of these gases will be greater, and so the 
intensity of receptor response will be the more intense, the closer 
the gas-emitting object is to the receptor. Intensity of olfactory 
rojeptor response accordingly becomes a basis for the mediation 
to the organism of a spatial relationship. The fact that gases are 
di^minated by means of air currents as well as by conduction has 
both advantage and disadvantages from the point of view of 
or^nism adaptation. If the air current reaches the receptor after 
p^smg near the redolent object, activation will occur, but if not, 
the gas will never reach the receptor and so the olfactory end organ 
will fail as a distance receptor. 

A much more dependable distance receptor has been developed 
incidental to the evolution of a receptor responsive to vibrations 
in tim air. Hie ultimate end organs of this receptor lie in the 
cc^Mea of the inner ear. The reason the sense of hearing is a 
gcKid distance receptor is that any vibrating object sets the neigh- 
bming ^ into vibration at the same rate, and this vibration is 
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propagated to adjacent air with gradually diminishing amplitude 
at the rate of a little over a thousand feet per second. This means 
that if an auditory receptor sensitive to a frequency of 100 vibra- 
tions per second begins sending neural impulses into the nervous 
system, an object vibrating at the rate of 100 per second must be 
somewhere in the external environment. 

The differential response of the auditory receptors to the par- 
ticular distance and direction of the vibrating object is mediated 
in a striking manner. Other things equal, the intensity of the 
\dbratory impact on the end organ varies inversely with the dis- 
tance. Unfortunately, this is a somewhat ambiguous criterion of 
distance, since the intensity of vibratory impact is also dependent 
upon the vibrational amplitude of the originating object. 

The matter of the receptor response to direction is more com- 
plex. This seems to be jointly dependent upon two factors. Since 
there is an ear at each side of the head, the ear which is turned 
toward the origin of an auditory vibration receives a somewhat 
more intense vibratory impact than does the other ear. A second 
factor, also dependent upon the double nature of the auditory re- 
ceptor, is that the phases of the air waves vary a little at the 
respective ears when one ear is turned toward the source as com- 
pared with the identity of the phase occurring when both ears are 
equally near the source. Moreover, the extent and nature of this 
dissimilarity in wave phase vary continuously as the head is 
rotated through 180 degrees from a position in which one ear is 
turned directly toTlrard the vibration source. 

The queen of the senses and the distance and direction receptor 
par excellence is the eye. The ultimate end organs of vision are 
microscopic rods and cones imbedded in the retina. The accessory 
mechanisms of the eye such as the iris, the lens, and the various 
humors are merely means for bringing the light energy reflected 
from the environment into adequate contact with the retina. 

Owing to the action of the lens system of the eye, limited por- 
tions of the external environment are projected with considerable 
fidelity upon the retina. In this way the spatial pattern of light 
frequencies reflected from environmental objects is presented di- 
rectly to the retina, giving precise and characteristic neural respon- 
siveness corresponding to each element of the pattern. This in- 
comparable sense organ, with its responsiveness to the different 
intensities of the various wave lengths of light, thus furnishes the 
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receptor basis for an almost nnlimited degree of differential adap- 
tive reaction to distinct stimulus objects and situations. 

Passing now to the matter of visual distance reception proper, 
we find that the physics of light furnishes a reliable foundation in 
the fact that the image of any given object as projected upon the 
retina varies inversely in si^e with the distance of the object from 
the eye. There is a slight ambiguity here in that the size of the 
image is also dependent upon the size of the object itself^ which may 
vary considerably. 

On the retina directly back of the pupil is a point of especially 
clear vision called the fovea. Other things equal, detailed sensi- 
tivity to visual patterns grows progressively less with distance from 
the fovea toward the periphery of the retina. Now it happens that 
the eye, unlike the ear, is a very mobile organ. It thus comes about 
that organisms readily learn to roll the eye in its socket in such 
a way that the image of an object of importance for adaptive re- 
action will fall on the fovea of each eye. This is called fixation. 
The movements of each eye in its socket are produced by the 
action of a set of six small external muscles. It is clear that the 
tension of the muscles which turn the eye in its socket, in conjunc- 
tion with the tension of the muscles of the neck, must yield a com- 
bination or pattern of proprioceptive neural impulses unambigu- 
ously correlated with the direction of the object fixated. 

Just as the doubleness of the hearing organ facilitates the re- 
ception of auditory distance cues, so the fact that we have twc 
ey^ aids in the reception of visual distance cues. Since the eyes 
are some inches apart, fixation on a single point near at hand 
produces a certain amount of convergence^ and the closer the ob- 
ject, the greater the convergence. This may easily be verified bj 
adking a friend to look at your finger as you move it forward and 
backward from six to eighteen inches before his face. As the eyes 
tern inward, the teimion of the muscles performing this actior 
vmos inv^^ly with the distance of the object fixated. The pro- 
pricx^tive receptors in certain of these muscles are accordingly 
stamulatei to an extent which varies inversely with the distance 
of &e object. In this roimdabout way, involving the joint actior 
of vision and proprioception, the differential sensory reception o' 
li^ance of objects within from 60 to 100 feet is accomplishec 
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THE MEDIATION OF TEMPORAL RELATIONSHIPS 

Careful observation of the conditions to which organisms must 
adapt themselves if they are to survive shows that in addition to 
the qualitative and spatial characteristics of the environment, the 
timing of behavior is frequently very important. The processes 
of organic evolution appear to have solved this problem in an even 
more roundabout and obscure manner than some of the receptor 
problems hitherto considered. In fact, the temporal characteris- 
tics of environmental events seem to be mediated without the 
action of any special receptor organ at all. 

It will be shown later (Figures 3 and 4) that the frequency 
of neural impulses emitted by a stimulated receptor undergoes a 
characteristic change during the continued action of the unchanged 
stimulating energy. There is also reason to believe that the effects 
of a stimulation which has ceased, persist for some time (p. 41), 
meanwhile undergoing progressive and consistent diminution. 
These changes in the neural responses to stimulation, almost purely 
as a function of time, are believed to furnish organisms with an 
adequate basis for timing their movements both during the con- 
tinuance of a critical stimulus and after its termination. 

THE PRIMARY PRINCIPLE OF STIMULATION 

The first step in the neural mediation of the state of both the 
internal and the external environment to the effectors of the or- 
ganism is dependent upon the principle of stimulation or excita- 
tion, The critical characteristic of this principle is that a small 
amount of energy acting on some specialized structure will release 
into activity potential energy from some other source, often in 
relatively large amounts. A familiar example is the trigger action 
of a gun. The projectile is impelled by the energy stored in the 
explosive charge; the pressure of the finger on the trigger merely 
serves to initiate the explosive action which, once started, is self- 
propagated. 

The principle of stimulation is operative at several points in 
the integrative apparatus of the mammalian organism, two of 
which we need to consider here. The first is Ihe action of mi 
energy source, such as light, upon a receptor organ such as the 
eye, which initiates a self-propagating impulse in an afferent nerve 
fibm:. The second is the action of an efferent neural impulse when 
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it impinges upon a muscle fiber. In this second case there results 
a release of energy stored in the cell which takes the form of 
longitudinal contraction (B). 

QXJAIj:TAa?IVB VERSUS QUANTITATIVE RECEPTOR ANALYSIS 
OP ENVIRONMENTAL ENERGIES 

Modem nemophysiological studies have shown that the neural 
discharges initiated by the stimulation of all receptors are substan- 
tially alike — a series of discrete waves (Figure 3). This seems to 



Fig. 3. Reproduction of a photographic record of the action potentials 
from a single optic-nerve fiber of a horseshoe crab when the end organ waa 
stimulated with different intensities of light. The gaps in the records represent 
the pa^ge of 2.8, 1.4, 4.5, and 3.3 seconds respectively. In record A, stimula- 
tion was .1 light unit; in B, .01; in C, ,001; and in £>, .0001. The duration of 
the action of the light on the end organ is indicated by the shadowy line just 
above the time line in each record. Note that the stronger the stimulation, 
the more rapid the neural impulses; and the longer the duration of the light, 
the slower the impulse emission. Note, also, that the light acts for an ap- 
preciable time before any neural impulses are emitted (latency) ; that this 
latency is Sorter, the more intense the light energy; and that usually there 
^ a few neural impulses emitted after the termination of the light. This is 
KXMmn as after-di^harge. (After Graham, 6, p. 830.) 

indicate that the only means whereby qualitative dilferentiation of 
mvmomnentel energies can take place in the nervous system must 
Me in the differentiation of the nerve fibers which transmit the 
Such an arrangement clearly provides a basis whereby 
■&e automatic switchboard activities of the central nervous system 
.may mute the impul^ initiated by qualitatively distinct stimuli 
Id^et forms of energy) to different muscles and muscle com- 
binations* 
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But if behavior is to be thoroughly adaptive, it must vary with 
the quantitative differences in environmental energies as well as 
with their qualitative differences. Observation indicates that 
higher organisms do, in fact, react differentially to varying intensi- 
ties of exactly the same forms of activating energy. If the radio 
gives forth too weak a tone we turn the volume knob in one direc- 
tion, and if it gives forth too strong a tone we turn the knob in 
the opposite direction. Delicate physiological investigations have 
revealed that the frequency of the neural waves or impulses emitted 
by a receptor is slow with weak stimulation and fast with strong 
stimulation. This principle is illustrated very nicely by a series of 
records published by Graham (6) and reproduced as Figure 3. The 
frequency of afferent neural impulses is accordingly the code re- 
ceived by the central nervous system which differentiates the 
various intensities of the same environmental energy. Somehow 
the central nervous system is evidently able to route nerve currents 
of different frequencies to the several muscle groups in much the 
same manner that it does neural impulses coming in over different 
fiber paths. 

CHARACTERISTICS OF THE AFFERENT NEURAL IMPULSE 
AND ITS PERSEVERATION 

It is clear that the immediate determinant of action in organ- 
isms is not the stimulating energy, but the neural impulse as finally 
routed to the muscles. A presumably critical neural determinant 
intermediate between these two extremes of stimulus (S) and re- 
sponse (E) is the afferent neural impulse ( 5 ) at about the time it 
enters the central ganglia of the nervous system. It is important 
to note that this afferent impulse (s) varies in certain ways not 
paralleled exactly by changes in the stimulating energy. A par- 
ticular form of this lack of parallelism is shown clearly in Adrian’s 
graph reproduced as Figure 4 and may be seen more concretely in 
Figure 3. In all receptors the frequency of receptor discharge be- 
gins at a low value and rapidly rises to a comparatively high 
maximum, after which the rate gradually falls, even though the 
stimulus continues to act without change. 

Certain molar behavioral observations render it extremely 
probable that the after-effects of receptor stimulation continue to 
rev^berate in the nervous system for a period measurable in 
seconds, and even minutes, after the termination of the action 
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types of receptors such, as those for light and sound, there is less 
opportunity for neural interaction than within a given receptor 
such as the retina. But still the richness of interconnections m 
the brain furnishes an ample basis for a certain amount of neural 
interaction here also. This presumption of a tendency for afferent 
neural impulses to interact before evoking organismic behavior has 
not yet received detailed physiological proof, and it accordingly 
has the status of an hypothesis. It will hereafter be called the 
hypothesis of afferent neural interaction, or, more briefly, the neural- 
interaction hypothesis. 

The neural-interaction hypothesis becomes of special importance 
when it is understood that organisms usually must behave in such 
a way as to surtdve in situations which, from the stimulus point of 
view, are exceedingly complex in the sense of involving the activa- 
tion of large numbers of receptors at the same time. Moreover, to 
be adaptive the behavior must sometimes occur only in the presence 
of a particular combination or configuration of afferent receptor 
impulses and not in response to any one of the component impulses; 
sometimes the situation is reversed — ^the reaction must occur in 
resi>onse to certain component impulses but not to the whole 
compound; and, finally, the response must sometimes be made to 
the componmt impulses irrespective of the occurrence or non- 
occurrence of other receptor impulses. It is obvious that such a 
seemingly inconsistent state of the environment presents an ex- 
tremely diflScult problem to the reacting organism, though not neces- 
sarily so to the theorist who would explain the degree of success in 
adaptation which various organisms actually manifest. It will be 
shown later (p. 349) that the hypothesis of afferent neural inter- 
action when combined wdth three other behavioral principles will 
enable us to understand how organisms are able to react to patterns 
or configurations of stimulation as such to the extent that this exists 
in fact. A notable situation of this kind is the response of or- 
ga m sans to distance as received through binocular vision (p. 38 ff.). 

THM ^OTAKEOTJS EMISSION OF IMIPULSES BY NERVE CELLS 

in this connection we must note another basic neural phenom- 
which ^ms to have rather far-reaching effects upon molar 
This is the apparent capacity of individual neurons 
^)onten^usly to g^erate neural impulses, as contrasted with the 
loog-necc^uzed capacity of nerve^ cells to continue the propagation 
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of a neural impulse transmitted to them from some other source. 
Many observations, such as that of the striking variability in or« 
ganismic behavior under seemingly constant external conditions, 
have suggested some such tendency. Paul Weiss has demonstrated 
the phenomenon by means of an extremely clever experiment: 

Fragments of spinal cord, including several segments, excised from 
larval salamanders . . . were grafted into the gelatinous connective tissue 
of the dorsal fin fold. ... In seven of the fourteen animals thus operated 
a limb was grafted at some distance anteriorly or posteriorly to the cord 
graft. . . . Histological study revealed . . . outgrowth of bundles of nerve 
fibers into the surroundings. The outgrowing nerve fibers form connec- 
tions with skin, trunk muscles, and in the presence of a grafted limb, also 
with the latter. . * . Within a few weeks of the transplantation these 
isolated cord limb complexes begin to exhibit functional activity, in which 
three successive phases can be roughly identified. . . . The first phase is 
characterized by intermittent or almost incessant twitching of the limb 
muscles. The twitches usually appear in spells, starting with irregular 
fibrillations and gradually building up. to violent convulsions. . . .At the 
peak of activity, the contractions are remarkably well synchronized, the 
limb executing strong periodic beats, sometimes at fairly regular intervals 
of the order of one to several seconds. ... As a crucial check against the 
possible intrusion of host innervation . . . the portion of the back con- 
taining the grafted units was completely excised and tested in isolation. 
Even so, the preparations exhibited the same fimctional activities as before. 
{9, p. 350-352.) 

It is at once evident that the spontaneous firing of nerve cells, 
if general throughout the nervous systems of normal adult organ- 
isms, must when taken in conjunction with the neural-interaction 
hypothesis imply an incessantly varying modification of both 
afferent and efferent impulses. From the point of view of the latter 
it is to be expected that this would produce both qualitative and 
quantitative variability of reaction to identical environmental re- 
ceptor stimulation, this variability presumably being a function of 
the normal “law^" of probability. This variability of response is 
called oscillation (p. 304 ff.) . 

SUMMARY 

For most animals, activity is necessary for survival. But not 
just any movement is sufficient; to be adaptive, movement must be 
coordinated in a precise manner with the state of various portions 
of the total environment. This coordination can occur only if the 
state of the environment is somehow continuously brought to bear 
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on the motor apparatus. Organic evolution has provided animals 
with special organs for receiving these physical environmental 
energies and converting them into neural impulses ; with a central 
mass of neural tissue which, acting as a kind of automatic switch- 
board, does a fair job of routing the receptor impulses in the direc- 
tion of the several muscles in adaptive amounts and proportions; 
and with efferent fibers for transmitting these routed impulses to 
the individual muscular elements, thus evoking the adaptive 
behavior. 

Receptor responses are prime examples of stimulation — ^the re- 
lease of relatively large amounts of resident energy by the action 
of small amounts of energy from an external source. Reception of 
touch, temperature, and pain is comparatively simple and requires 
no comment. Articulatory movement is received by special pro- 
prioceptive end organs. Progressive and angular movement is 
received by the vestibular receptor. Spatial relationships are ob- 
tained in various indirect ways, notably through the patterning of 
impulses received through the ears and eyes. Temporal relation- 
ships are also received in various indirect ways but chiefly through 
the progressive diminution both in the frequency of impulse emis- 
sion by the receptor during stimulation and in the strength of the 
“stimulus trace^^ following the termination of stimulation. . 

The reactions of organisms are ultimately evoked by the neural 
impulses which are relayed by the central nervous system to the 
muscles and glands. In a very real but indirect sense, however, 
reaction is evoked by the stimulus energies dependent on stimulus 
objects, though the neural impulse arising from the stimulation of 
a given receptor is by no means constant, even during the steady 
action of such a stimulus. The stimulus trace is a hypothetical 
perseverative process in the receptor areas of the brain which is 
believrf to follow the termination of a receptor discharge with 
gradu^y decreasing strength for some seconds and possibly 
minuto. 

Another factor preventing complete agreement or “constancy” 
l^wem the stimulation element and the afferent discharge to the 
bmin is Imlieved to be neural interaction — ^the mutual modification 
of all impul^ active in the nervous system at any given time, but 
^^^^y aff^ent impulses, more particularly those arising in the 
<XHn|Kmnd r^epte organ. This hypothesis makes possible 
of many important behavior phenomena, otherwise 
m the jwwer of organisms to react to patterns. 
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of stimulation as distinguished from the elements making up the 
pattern. Finally there must be added the presumptive phenom- 
enon of spontaneous periodic emission of neural impulses by all 
the cells of the nervous system. This, coupled with the interaction 
hypothesis, explains many obscure beha\dor phenomena, notably 
the variability or oscillation (4, p. 74) of behavior under what may 
be presumed to be relatively static environmental conditions. 

In the light of the preceding considerations, we formulate the 
following as primary molar behavior principles, postulates, or laws: 

POSTULATE 1 

When a stimtilus energy (S) impinges on a suitable receptor organ, an 
afferent neural impulse (s) is generated and is propagated along con- 
nected fibrous branches of nerve cells in the general direction of the 
effector organs, via the brain. During the continued action of the 
stimulus energy (S), this afferent impulse (s), after a short latency, rises 
quickly to a maximum of intensity, following which it gradually falls to 
a relatively low value as a simple decay function of the maximum. After 
the termination of the action of the stimulus energy (S) on the receptor, 
the afferent impulse (a) continues its activity in the central nervous 
tissue for some seconds, gradually diminishing to 2ero as a simple decay 
function of its value at the time the stimulus energy (S) ceases to act 

POSTULATE 2 

All afferent neural impulses (a) active in the nervous system at any 
given instant, interact with each other in such a way as to change each 
into something partially different (a) in a manner which varies with every 
concurrent associated afferent impulse or combination of such impulses. 
Other things eqtial, the magnitude of the interaction effect of one afferent 
impulse upon a second is an increasing monotonic function of the magni- 
tude of the first. 


NOTES 

Pavlov's Statement of the Principle of Afferent Neural Interaction 

While Pavlov did not make much systematic use of the prindple of afferent 
neural interaction, he stated it clearly and explicitly, as is shown by the f crflomng 
quotations assembled by Woodbury (10). The italics in all cases axe Woodbury^s, 
In connection with the phenomenon of conditioned inhibition, Pavlov remarked: 
‘When the additional stimulus or its fresh txace left in the hemispheres coincides 
with the acMon of the pc^tive stimulus, there must result some sort of special 
physiological ftision of ^ect of the sUmuLi into one armpownd excUaUon par&y 
differing from and partly resembling the positive oneJ* (7, p. 7D) 

Latex, when with what in the present work are called slirmjdus pcdtems 
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(p. 349 Pavlov stated: ^^The cases mentioned above show that a definite iriter- 
action takes 'place between different cells of the cortex, resulting in a fusion or syn- 
thesis of their physiological activities on simultaneous excitation. (7, p. 144) 
‘Tlainly the experiments reveal the great importance of the synthesizing 
activity of the cortical cells which are undergoing excitation. These cells must 
form, under the conditions of a ^ven experiment, a very complicated excitatory 
unit, which is functionally identical with the simple excitatory units existing in 
the case of more elementary conditioned reflexes. Such active cortical cells must 
necessarily influence one another and interact with one another, as has clearly been 
demonstrated in the case of compound simultaneous stimuli. The mutual inter- 
action between the excited or inhibited cortical elements in the case of compound 
successive stimuli is more complicated; the effect of an active cortical cell upon 
the one next excited varies according to the influence to which it was itself sub- 
jected by the cell last stimulated. In this way it is seen that the order in which a 
given group of stimuli taking part in a stimulatory compound are arranged, and 
the pauses between them are the factors which determine the final result of the 
stimulation, and therefore most probably the form of the reaction ” (7 
pp. 147-148) ••• 

While Pavlov clearly recognized the principle which we have called afferent 
neural interaction as weU as the importance of its r61e in the process of condi- 
tioning reactions to compound stimuli in distinctive combinations or patterns, 
it is noteworthy that he did not recognize how the conditioning of reactions to 
Stimulus patterns as such can be derived. In effect this means that he left the 
process of patterning as a primary principle. It can, however, be derived as a 
secondary phenomenon from four of his other principles (p. 349 ff.) which appear 
to be true primary molar laws: 

1. Afferent neural interaction (as indicated by the above quotations) 

2. Experimental extinction (p. 258 ff.) 

3. The generalkation of excitation effects (irradiation of excitation, p. 183) 

4. The generalization of extinction effects (irradiation of inhibition, p. 262) 


Afferent Neural Interaction and the Configuration Psychologies 

. is reason to believe that most of the Gestalt writers make extensive but 

^p cit use of a principle which is substantially equivalent in some respects at 
^ to the principle of afferent neural interaction. Kohler, however, has been 
m this rasp^t. In connection with a discussion of perceptual conscious 
remarks in a recent publication: “Our present knowledge of human 
p^pton tev^ no doubt as to the general form of any theory which is to do 
^owledge: a theory of perception must be a field theory. By this 
we t!mt the neural functions and processes with which the perceptual facts 
m each are located in a continuous medium; and that the 
that influence the events in other regions in a way 

is p 55) y on the properties of both in their relation to each other.” 

more e^rpHcitly: “To 
tiiose observations bear witness to an interaction 
be ote(^ wittun the phenomenal realm, they cannot he 
""Asatood m pmdy peyehologieal tenns. According to our general program 
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shall therefore assume that the interaction occurs among the brain correlates 
of the perceptual facts in question.” (5, p. 63) 

Continuing the elaboration of this same general point of view, Kohler adds: 
“If in a certain sense the correlate of a percept may be said to have a circum- 
scribed local existence we shall none the less postulate that as a dynamic agent 
it extends into the surrounding tissue, and that by this extension its presence is 
represented beyond its circumscribed locus. There is no contradiction in these 
statements. So far as certain properties of the percept process are concerned, 
this process may be confined within a restricted area, and with this nucleus the 
percept itself may be associated as an experience. At the same time the presence 
of such a percept nucleus may lead to further events in its environment, of which 
we are for the most part not directly aware; but this halo or field of the percept 
process may be responsible for any influence which the process exerts upon other 
percept processes.” (5, p. 66) 

It seems likely that in case an attempt were made to utilize Kohler^s inter- 
action principle in developing a thoroughgoing theory of the reaction of organisms 
to stimulus configurations, it would need to be supplemented by additional 
principles analogous to those required to supplement Pavlovas parallel principle, 
and for the same reasons. 
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CHAPTER IV 


The Biological Problem of Action and Its Coordination 

The receptors of an organism may respond with neural impulses 
in code corresponding to the near-by presence of food, of an enemy, 
or of a potential mate. But for the food to be seized and digested, 
the enemy to be escaped, or the process of reproduction to be 
initiated, the organism must do something; i.e., it must act. , Just 
as in the last chapter we surveyed the manner in which the processes 
of organic evolution have solved the receptor problem, so in the 
pr^nt one we shall consider how nature has evolved a solution 
to the problem of action. 

The effector activity of higher organisms is of two major kinds 
— secretional and motor. Generally speaking, the control of adap- 
tive secretion, such as that of saliva, seems to follow the molar laws 
of movement. Indeed, some of the most important molar laws 
of learning were originally isolated by Pavlov and his pupils 
(£, p. 19 ff.) through the study of conditioned salivary secretions. 
Because of their greater variety and general interest, discussion in 
the present chapter will be confined largely to the motor effectors. 

THE MOTOR ORGAN 

In contrast to the diversity of organs evolved for the reception 
of environmental energies, the equipment evolved for the execution 
of movanent is comparatively without variety. It consists of only 
one type of organ — the muscle.. There is plenty of variety in the 
behavior of or g a nisms , but the variety arises mostly from the 
location and attachments of the several muscles and the permuta- 
tKffls and combinations of their joint action rather than from their 
^lential structure. From the point of view of behavioral adapta- 
tkm, ihe^charaeteristic function of muscle is contraction. By “con- 
ba^on ’ is meant longitudinal shortening which, of course, neces- 
%nly means transverse thickening. 

nmscosmpic structure of muscle parallels to a considerable 
gross sfcructnre. The muscle cells are elongated bodies, 
Mativ# thick m the imddle and tapering at the end. These fibers 
are arbvated m tiieir contraction by neural impulses flowing in 

50 ^ 
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from the central nervous system along the fibrous branches of nerve 
cells. The actual junction of the nerve fibers with the individual 



Fig. 6. A section in profile of the motor end-plate of a striated muscle 
from a young mouse. The vertical striations at the bottom of the figure 
represent the muscular tissue. (From Fulton, after Boeke, p,. 198.) 


muscle cell is made by a specialized structure called the neural 
end’-plate. Typical coimections between nerve fibers and muscles 
are represented in Figure 6. 

TJBCE ALL~OE-]SrO]SrE LAW OF MUSCLE-ilBER ACnON 

Just as the intensity of stimulation needs to be transmitted to 
the organism by means of a graded neural code, so the rate and 
general vigor of muscular contraction need to be regulated and 
controlled in order that adaptation may be ^equate. Movement 
in some situations, such as the flight of a mouse when pursued by 
a cat, must be rapid and even violent in intensity, whereas in 
others, such as that of the cat in stalking the mouse, it must be 
slow and gentle. Nature has evolved a solution to the problem 
of the gradation of the intensity of aclion in the ^^all-or-none’^ 
mode of muscle-fiber response to neural discharge. 

Within recent years ingenious and delicate physiological inv^- 
tigations have succeeded in isolating small numbers of muscle fib^, 
in systematically varying the intensity of the neural impulses dis- 
charging into them, and in measuring the extent of the r^ulting 
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muscular action. The outcome of such an experiment is repro- 
duced as Figure 7. There it may be seen that as the intensity of 
the neural discharge gradually rises, the amplitude of the reaction 
of the muscle as a whole increases, not gradually but by a series 
of sharply marked steps; again, as the intensity of the stimulus 
gradually subsides, the step-wise behavior of the muscle is re- 
versed. Microscopic observation of the individual muscle cells 
under such experimental conditions confirms the hypothesis that the 
step-wise rise and fall in the magnitude of muscular activity of 



Fig. 7. Reproduction of a photographic record (above) of the movements 
of a muscle made up of a very small number of fibers. Below is a graphic 
record of the variation in the magnitude of electrical stimulation (break 
^ocks). jE^eciaUy note, at the left of the record, that the magnitude of the 
dicwk rises^ and falls gradually yet the muscular reaction rises and falls by a 
series of dis(arete steps. These steps are believed to be due to the entrance 
or c^sation of the action of discrete muscle fibers, which is the substance of 
the ail-or-none law, viz., that a particular muscle fiber either responds 
m ax ima l l y or not at aU, regardless of the amount of stimulation delivered. 
(From Fulton, J, p. 52.) 


Mgure 7 is due to the fact that at each step in the reaction record 
a new fiber of the muscle has become active or inactive respec- 
Mveiy, Meanwhile those fibers previously innervated by the weaker 
neural di^harge continue also to be innervated by all stronger 
d^hai^^ The eonelusion from this and much other evidence 
of a mmiiar nature is that each individual muscle fiber has a reac- 
fen tin:edM>ld of its own, below which it will not respond at all 
wMdi if will respond with its maximum contraction, 
tiie ^veral muscle fibers differ considerably in this 
threshold, Le., in the intensity of neural discharge required 



TJ]SrixEARNEI> COORDINATION OF MUSCULAR ACTIVITY 


For some organisms, muscles alone suffice for locomotion and 
other biological needs. Thus an earthworm is able to crawl about 
and to secure food; if stimulated by a touch it can withdraw into 
the safety of its burrow with quite remarkable suddenness. But 
for the more exacting survival requirements of higher organisms, 
muscles must be combined with relatively rigid levers. In lower 
forms of life such as insects, the lever system is on the outside of 
the body and serves also as a kind of protective armor. The levers 
of higher organisms consist of bony structures within the body, 
those levers primarily concerned with locomotion being generally 
rod-like in form. In order that the bony levers may be moved 
in various directions, muscles are usually attached to them in 
pairs called antagonists. Thus the contraction of the biceps on 
the top of the arm bends or flexes the arm at the elbow, whereas 
the contraction of the triceps on the opposite side of the arm re- 
verses the movement, straightening or extending the arm. By the 
combined action of various bones, joints, and muscular contrac- 
tions, terminal portions of the body such as the hand, say, may 
be moved in almost any conceivable direction or combination of 
directions. 

The joint action of two or more muscles in the adaptive move- 
ment of a portion of an animaFs body is called mmadar coordirm^ 
tion. Sometimes a dozen or more distinct muscle are involved in 
a single coordinated movement, such as the extension of the hind 
leg of the cat (Figure 8). This coordination (or integration, as it 
is sometimes called) is brought about in the main by the action 
of the nervous ^stem. Some of these coordinations are so simple 
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the survival of the organism in so far as it is dependent upon food. 
It is noteworthy that one mdispensable link in this chain— the 
flow of the milk— depends upon the very specialized action of the 
external environment, and once the milk is in the mouth the food 
itself constitutes the medium which joins the several links of the 
reflex chain. 


SUMMARY 

Generally speaking, higher organisms must be more or less 
active or perish. The effector organs of the typical higher organ- 
ism consist of glands and muscles. Movement is produced by the 
longitudinal contraction of the individual fibers making up the 
muscles. The several fibers of a muscle vary widely in the inten- 
sity of the neural impulse necessary to evoke action. For all inten- 
sities of neural impulse below the threshold the fiber will be wholly 
inactive, and for all intensities above the threshold it wiU be equally 
and maximally contractile. 

The infinitely complex results produced by the simple muscular 
contractions of organisms are brought about by the various com- 
binations of a relatively small number of muscles, all contracting 
various degrees and at various rates, several contractions jointly 
determining the position of a bodily part such as a finger or a foot. 

Certain adaptive situations are of such regularity that ready- 
made chains of reflex receptor-effector connections are adequate 
for survival, e.g., the blinking of the eyelid at any rough contact 
wiii the cornea. In many chains of reflex activity the action of 
the environment supplies indispensable links of the chain, as in the 
suckling of a young animal. 
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DRIVES ARE TYPICAL INTERVEMI^G VARIABLES 

It is important to note in this connection that the general con- 
cept of drive (D) ^ tends strongly to have the systematic status 
of an intervening variable or X (see Figure 1) , never directly ob- 
servable. The need of food, ordinarily called hunger, produces a 
typical primary drive. Like all satisfactory intervening variables, 
the presence and the amount of the himger drive are susceptible 
of a double determination on the basis of correlated events which 
are themselves directly observable. Specifically, the amount of 
the food need clearly increases with the number of hours elapsed 
since the last intake of food; here the amount of hxmger drive (jD) 
is a function of observable antecedent conditions, i.e., of the need 
which is measured by the number of hours of food privation. On 
the other hand, the amount of energy which will be expended by 
the organism in the securing of food varies largely with the inten- 
sity of the hunger drive existent at the time; here the amount of 
'^hunger” is a function of observable events which are its conse^ 
quence. As usual with unobservable, the determination of tiie 
exact quantitative functional relationship of the intervening vari- 
able to both the antecedent and the consequent conditions presents 

^In case the reader subsequently fails to recall the meaning of this, or 
any of the other signs employed in the present volume, the significance may 
be recovered in a moment by consulting the alphabetical list of signs and 
their meanings given in the Glossary of Symbols (p. 403 ff.). 
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serious practical difficulties. This probably explains the paradox 
that despite the almost universal use of the concepts of need and 
drive, this characteristic functional relationship is not yet deter- 
mined for any need, though some preliminary work has been done 
in an attempt to determine it for hunger (1), 

INNATE BEHAVIOR TENDENCIES VARY ABOUT A CENTRAL RANGE 

With our background of organic evolution we must believe that 
the behavior of newborn organisms is the result of unlearned, i.e., 
inherited, neural connections between receptors and effectors {sVr) 
which have been selected from fortuitous variations or mutations 
throughout the long history of the species. Since selection in this 
process has been on the intensely pragmatic basis of survival in 
a life-and-death struggle with multitudes of factors in a consider- 
able variety of environments, it is to be expected that the innate 
or reflex behavior of young organisms will, upon the whole, be rea- 
sonably well adapted to the modal stimulating situations in which 
it occurs. 

It may once have been supposed by some students of animal 
behavior, e.g., by Pavlov and other Russian reflexologists, that 
innate or reflex behavior is a rigid and unvarying neural con- 
nection between a single receptor discharge and the contraction of 
a particular muscle or muscle group. Whatever may have been the 
views held in the past, the facts of molar behavior, as well as the 
general dynamics of behavioral adaptation, now make it very 
clear not only that inherited behavior tendencies (sUr) are not 
strictly uniform and invariable, but that rigidly uniform reflex 
behavior would not be nearly so effective in terms of survival in 
A highly variable and impredictable environment as would a he- 
havior tendency. By this expression is meant 'behavior which will 
vary over a certain range, the frequency of occurrence at that seg- 
m®t of the range most likely to be adaptive being greatest, and 
the frequency at those segments of the range least likely to be 
adaptive being, upon the whole, correspondingly rare. Thus in the 
expE^^on sUr, R represents not a single act but a considerable 
of more or 1^ alternative reaction potentialities. 

The i^urophysiological mechanism whereby the type of flexible 
r^ptor-eff^OT dynamic relationship could operate is by no means 
w^ly but a number of factors predisposing to variability 
of mre evident. First must be nientioned the spontaneous 



impulse discharge of individual nerve cells, discussed above (p. 
44). This, in conjunction with the principle of neural interaction 
operating on efferent neural impulses {efferent neural interaction) y, 
would produce a certain amount of variability in any reaction. 
Secondly, the variable proprioceptive stimulation arising from the 
already varying reaction would, by afferent neural interaction, 
clearly increase the range of variability in the reaction. Finally, 
as the primary exciting (drive) stimulus increases in intensity, it 
is to be expected that the effector impulses will rise above the 
thresholds of wider and wider ranges of effectors until practically 
the entire effector system may be activated. 

Consider the situation resulting from a foreign object entering 
the eye. If the object is very small the stimulation of its presence 
may result in little more than a slightly increased frequency of 
lid closure and a small increase in lachrymal secretion, two effector 
processes presenting no very conspicuous range of variability except 
quantitatively. But if the object be relatively large and rough, 
and if the stimulation continues after the first vigorous blinks and 
tear secretions have occurred, the muscles of the arm will move 
the hand to the point of stimulation and a considerable variety of 
manipulative movements will follow, all more or less likely to 
contribute to the removal of the acutely stimulating object but 
none of them yrecisely adapted to that end. 

In the case of a healthy human infant, which is hungry or is 
being pricked by a pin, we have the same general picture, though 
the details naturally will differ to a certain extent. If the need be 
acute, the child will scream loudly, opening its mouth very wide 
and closing its eyes; both legs will kick vigorously in rhythmic 
alternation, and the arms will flail about in a variety of motions 
which have, however, a general focus at the mouth and eyes. In 
cases of severe and somewhat protracted injurious stimulation the 
back may be arched and practically the entire musculature of the 
organism may be thrown into more or less violent activity. 
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to micturate, the need for rest (after protracted exertion), the need 
for sleep (after protracted wakefulness) , and the need for activity 
(after protracted inaction) . The drives concerned with the main- 
tenance of the species are those which lead to sexual intercourse 
and the need represented by nest building and care of the young. 
The primary core or mode of the range of innate or reflex ten- 
dencies to action must naturally vary from one need to another if 
the behavior is to be adaptive. In cases where the role of chance 
as to what movements will be adaptive is relatively small, the 
behavior tendency may be relatively simple and constant. For 
example, the acute need for oxygen may normally be satisfied (ter- 
minated) by inspiration; the need represented by pressure in the 
urinary bladder is normally terminated by micturition. It is not 
accidental that these relatively stereotyped and invariable reactions 
are apt to concern mainly those portions of the external environ- 
ment which are highly constant and, especially, the internal en- 
vironment which is characteristically constant and predictable. 

In the case of mechanical tissue injury, withdrawal of the in- 
jured part from the point where the injury began is the character- 
istic reflex form of behavior, and the probability of the effectiveness 
of such action is obvious. Environmental temperatures consider- 
ably below the optimum for the organism tend to evoke shivering 
and a posture presenting a minimum of surface exposed to heat loss. 
Temperatures above the optimum tend to produce a general inac- 
tivity, a posture yielding a maximum surface for heat radiation, 
and rapid panting. In certain relatively complex situations such 
as those associated with the need for food, water, or reproduction, 
toe factor of search is apt to be included as a preliminary. Since 
extensive search involves locomotion, the preliminary activities 
arisang from to^e three needs will naturally be much alike. 

OBGAHIC €X3NT>mOKS WHICH INITIATE THREE TYPICAL 
PRIMARY URIVE BEHAVIORS 

Thmug rajent years physiologists and students of behavior have 
important advances in unraveling the more immediate con- 
which are associated with the onset of the activities char- 
of ^ tor^most complex primary drives — thirst, himger, 
Thi^ aetiviti^ appear from these studies to be initiated 
% a toe mouto and throat caused by the lack of saliva, 

wteh M ite tern by toe lack of available water in toe 
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the stomach, terminating in a kind of cramp followed by a period 
of cessation. Presently the stomach contractions begin again, and 
are more or less continuous throughout the remainder of the record. 
But the main point of this is that the restless movements of the 
sleeping student (recorded as short vertical oscillations of the 
middle line in Figure 9) occurred as a rule only when the stomach ' 
contractions were occurring, especially when they were at a maxi- 
mum. 

Richter {2) attempted to secure parallel records of the stomach 
contractions and the restless locomotor activity of rats and other 
organisms to complete the proof of the presumptive relationship 
afforded by Wada’s findings, but was unsuccessful, apparently be- 
cause of technical diflSculties encountered. However, he was able 
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Pig. 10- Diagrammatic representation of the inferred relationship between 
the periodic stomach contractions of rats and their restless locomotor activity 
m the living cage. (After Richter, 2 , p. 312.) 


to show that rats are periodically rather restless in the living cage 
for a short time before going into a food chamber to eat, after 
which g^eral activity quickly subsides. The periodicity of these 
resU^ movements is about the same as that known to occur with 
tile ^mach contractions. Richter accordingly concludes from a 
ecmvincing array of such indirect evidence that the relationship 
between random, r^less activity and the gastric hunger contrac- 
tions m substantially that shown in Figure 10. It is to be observed 
tiiat this figure is not a record but, rather, a diagrammatic repre- 
of an inferential relationship. Nevertheless, Richter^s 
is fairly convincing and Figure 10 quite probably repre- 
3^% time sitiiation. 

fiBMrtaimal interpretation of this restless behavior is that 
«B nrgMn^ which moves about more or less continuously will in 
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general traverse a wide area and consequently will be more likely 
to encounter food than if it remains quietly in one place. 

TYPICAL STUDIES OF SEXUALLY MOTIVATED ACTIVITY 

Richter has also shown that the male rat displays much more 
restless locomotor activity when the sex drive is operating than 
when it is not. He placed male rats in drum-like cages pivoted 
on a central axis in such a way that if the animal attempted to 
climb the circular side of the cage its weight would turn the drum. 
Automatic counting devices aggregated the amount of this kind of 



Feg. 11. Graphic representation of restless locomotor activity of a male 
albino rat in a revolving cage before and after castration. (After Richter, 
t, 329.) 


locomotor activity by days. A graphic representation of the nu- 
merical values so obtained is shown at the left of Figure 11. At 
about the 195th day the rat was castrated. Note the abrupt drop 
in the restless locomotor activity. Even if one makes a certain 
allowance for the shock produced by the operation as such, the 
inference is that when the hormone secreted by the testes is in the 
blood, the animal is generally active, but when this hormone is 
withdrawn through castration, generalized locomotor activity falls 
to a relatively low level and remains there. 

Wang ( 4 ) has shown by analogous means that the female rat 
is maximally active in this same r^less fashion about every fourth 
day, the two or three days between showing a relatively small 
amount of activity. A considerable number of cycl^ from a 
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female rat are represented graphically in Figure 12 ( 2 ). That 
these maxima of locomotor activity are coincident with periodically 
recurring sex drive is shown by the fact that on the occasion of 
the maxima such animals are receptive to the sexual advances of 
the male. 

The functional interpretation of these studies is similar to that 



^ niv^gataons involving hunger; an orgardsm which moves 
will traverse a wide area and consequently 
, . , l&ely to encounter a mate than will an organism 

whieh rmnmns m a single place. 


SVMMAET 


nf *1. - - as aggregations of needs. The 

<a tte Sector apparatus is to mediate the satiation of 
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these needs. They arise through progressive changes within the 
organism or through the injurious impact of the external environ- 
ment. The function of one group of receptors (the drive recep- 
tors) is to transmit to the motor apparatus, via the brain, activat- 
ing impulses corresponding to the nature and intensity of the need 
as it arises. Probably through the action of these drive receptors 
and receptor-effector connections preestablished by the processes 
of organic evolution, the various needs evoke actions which increase 
in intensity and variety as the need becomes more acute. 

Because of the inherently f ortuitous nature of the environmental 
circumstances surrounding an organism when a state of need arises, 
the kind of behavior which will be required to alleviate the need 
is apt to be highly varied. For this reason rigid receptor-effector 
connections could not be very effective in terms of organismic sur- 
vival. Accordingly we find, as a matter of fact, that innate molar 
response to a given state of need presents a considerable range of 
activity, the activity often consisting of a sequence of short cycles 
of somewhat similar yet more or less varied movements. Such 
behavior cycles are believed normally to show a frequency dis- 
tribution in which those acts most likely to relieve the need occur 
most often, and those acts less likely to terminate it occur corre- 
spondingly less often. Thus reflex organization (sUb) has more 
than one string to its bow; if one reaction cycle does not terminate 
the need, another may. The modal form of reaction will also be 
strongest, so that it will usually occur not only most frequently, but 
earliest and probably will remedy the situation; but if the environ- 
ment chances to be such that some other simple action sequence is 
required, in due course this action sequence will probably occur 
and the organism will survive. Finally, in still more complicated 
situations a particular combination of these acts may terminate the 
need. In this way innate behavior tendencies are organized on a 
genuine but primitive trial-and-error basis. 

Lastly, it is to be noted that just as food-seeking activity 
begins long before the organism is in acute need of food, so other 
drives become active long before heat or cold or pain becomes seri- 
ously injurious or even in the least harmfuL In short, it may be 
said that drives become active in situations which, if more intense 
or if prolonged, would become injurious. Once more, then, the 
probability aspect of primitive behavior tendencies become mam- 
fest. 
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In the light of the preceding considerations we formulate the 
following primary molar behavior principle: 

POSTULATE 3 

Organisms at birth possess receptor effector connections (sUr) which, 
under combined stimulation (S) and drive (Z>), have the potentiality of 
evoking a hierarchy of responses that either individually or in combi- 
nation are more likely to terminate the need than would be a random 
selection from the reaction potentials resulting from other stimulus and 
drive combinations. 

NOTES 

The Role of Adaptation in Systematic Behavior Theory 

Ihe emphasis in this and preceding chapters on the general significance of 
organic evolution in adapting organisms to meet critical biological emergencies 
calls for a word of comment, lest the reader be misled in regard to the r61e that 
adaptation, as such, plays in the present system. It is the view of the author 
that adaptive considerations are useful in making a preliminary survey in the 
search for postulates, but that once the postulates have been selected they must 
stand on their own feet. This means that once chosen, postulates or principles 
of behavior must be able to yield deductions in agreement with observed detailed 
phenomena of behavior; and, failing this, that' no amount of a priori general 
Adaptive plauability wiU save such a postulate from being abandoned. 

"Problems Associated with the Use of Drive (D) as an Intervening Variable 

Most writers on behavior theory utilize the concept of need or some equivalent 
such as drive, though hardly one of them has faced squarely the associated problem 
of fi n di ng the two equations necessarily involved if the concept of drive is to take 
its place in a strict mathematical theory of behavior (S), In the case of hunger, 
f<a' example, there must be an equation expressing the degree of drive or motiva- 
ikm as a function of the number of hours^ food privation, say, and there must 
be a second equation e^ressing the vigor of organismic action as a function of 
degree of drive (D) or motivation, combined] in some manner with habit 
A correlated task of some magnitude is that of objectively defining a 
unit in win<^ to e3qH:ess the degree of such a motivational intervening variable 
Cseepwmff.). 

Mow it is a relativdy easy matter to find a single empirical equation expressing 
v%cMr cf reaction as a function of the number of hours^ food privation or the 
atrengtii ^ an electnc shock, but it is an exceedingly difficult task to break such 
^ up into the two really meaningful component equations involving 

diive (D) or motivation as an intervening variable. It may confidently 
b^pe&ted that Aany writers with a positivistic or anti-theoretical inclination 
re^^ a pro<^ure as both futile and unsound. From the point of view 
such a poeedure, if successful, would present an immense 
s^tement is made on the assumption that motivation (D) as 
whe^^ its migp be food privation, electric shock, or whatever, bears a 
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certain constant relationship to action intensity in combination with other factors, 
such as habit strength. If this fundamental relationship could be determined 
once and for all, the necessity for its determination for each special drive could not 
then exist, and so much useless labor would be avoided. Unfortunately it may 
turn out that what we now call drive and motivation will prove to be so hetero- 
geneous that no single equation can represent the motivational potentiality of any 
two needs. Whether or not this is the case can be determined only by triaL 
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CHAPTER VI 


The Acquisition of Receptor-Effector Connections — 
Primary Reinforcement 

We have seen above that organisms require a considerable 
variety of optimal conditions if the individual and the species are 
to survive. In many cases where the conditions, particularly in- 
ternal ones, deviate materially from the optimum, complex auto- 
matic physiological processes make the adjustment. An example 
of this is the remarkable manner in which the blood is maintained 
at a practically constant state in the face of a great variety of 
adverse conditions. This type of automaticity has been called by 
Cannon the “wisdom of the body'^ (S), In the case of certain 
other needs, and here lies our chief interest, the situation is remedi- 
able only by movement, i.e., muscular activity, on the part of the 
organism concerned. The processes of organic evolution have pro- 
duced a form of nervous system in the higher organisms which, 
under the conditions of the several needs of this type, will evoke 
without previous learning a considerable variety of movements 
each of which has a certain probability of terminating the need. 
This kind of activity we call behavior. 

THE PEOBLEM AKD GENERAL NATURE OP LEARNING 

It is evident, however, that such an arrangement of ready-made 
{inherited) receptor-effector tendencies, even when those evoked 
by each state of need are distinctly varied, will hardly be optimally 
effective for the survival .of organisms living in a complex, highly 
variable, and consequently unpredictable environment. For the 
optimal probability of survival of such organisms, inherited be- 
havior tendencies must be supplemented by learning. That learning 
dc^ in fact greaiJy improve the adaptive quality of the behavior 
of higter organims is attested by the most casual observation. 
But the detailed nature of the learning process is not revealed by 
casual ol^rvation; this becomes evident only through the study 
of many carefully deigned and executed experiments. 

Ihe e^^tial nature of the learning process may, however, be 
quite simply. Just as the inherited equipment of reaction 
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tendencies consists of receptor-effector connections, so the process 
of learning consists in the strengthening of certain of these con- 
nections as contrasted with others, or in the setting np of quite 
new connections. In many ways this is the highest and most sig- 
nificant phenomenon produced by the processes of organic evolu- 
tion. It will be our fascinating task in the present and several 
succeeding chapters to tease out bit by bit from the results of very 
many experiments the more important molar laws or rules accord- 
ing to which this supremely important biological process takes 
place. 

In accordance with the objective approach outlined in Chapter 
II we must regard the processes of learning as wholly automatic. 
By this it is meant that the learning must result from the mere 
interaction between the organism, including its equipment of action 
tendencies at the moment, and its environment, internal as well 
as external. Moreover, the molar laws or rules according to which 
this interaction results in the formation or strengthening of recep- 
tor-effector connections must be capable of clear and explicit state- 
liient. Recourse cannot be had to any monitor, entelechy, mind, 
or spirit hidden within the organism who will tell the nervous 
system which receptor-effector connection to strengthen or which 
receptor-effector combination to connect de novo. Such a pro- 
cedure, however it may be disguised, merely raises the question of 
the rule according to which the entelechy or spirit itself operates; 
this, of course, is the original question all over again and clarifies 
nothing. 

THE STREDS'GTHElSriNG OF IKHATE RECEPTC3-EFFBCT0R 

corfHEcnoNS 

Because' of its presumptiye temporal priority in the life of the 
organism, we shall consider first the problem of the selective 
strengthening of one among a variety of inherited movement ten- 
dencies evoked by a need in a particular environing situation. This 
can perhaps best be done by means of an illustrative experiment^ 
even though some of the reaction tendencies there operative may 
already have been modified by learning. The ^qpeiimeatal pro- 
cedure and the results will be described in a little detail, out of 
consideration for readers who have slight knowledge of the routine 
methodologies characteristic of behavior laboratories. 
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r Demonstration Experiment A. The laboratory in which the experiment- 
is performed is without windows and its walls are painted black; this gives 
the room an appearance of being rather dimly illuminated, though in fact 
it is not. On a table rests a black wooden apparatus about two feet long, 
a foot wide, and a foot high. It has a hinged glass lid which permits clear 
observation of the interior. The floor of the box consists of smah trans* 
verse rods of stainless steel placed about a quarter inch apart. Midway 
between the two ends of the box is a partition consisting of the same 
type of metal rods similarly arranged but placed vertically. This partition 
or barrier reaches to within about four inches of the lid. A two-throw 
electric switch permits the charging of the floor rods of either compart- 
ment and of the partition with a weak alternating current. 

On a second table nearby there rests a wire cage containing a sleek 
and lively albino rat about one hundred days of age. The laboratory 
technician opens the lid of the cage and the rat at once stands up on 
its hind legs with its head and forepaws outside the aperture. The tech- 
nician grasps the rat about the middle with his bare hand and transfers 
it to one of the compartments of the apparatus. The animal, after a 
brief pause, begins mo\Tng about the compartment, smflBng and inspecting 
the various parts, often stretching up on its hind legs to its full length 
against the walls of the box. 

After some minutes the technician throws the switch which charges 
both the partition and the grid upon which the rat is standing. The 
animal^s behavior changes at once; in place of the deliberate exploratory 
movements it now displays an exaggeratedly mincing mode of locomotion 
about the compartm^t interspersed with occasional slight squeaks, biting 
of the bam which are shocking its feet, defecation, urination, and leaps up 
the walls. These reactions are repeated in various orders and in various 
parts of the compartment; sometimes the same act occurs several times in 
sucee^on, ^metimes not. After five or six minutes of this variable be- 
havior one of the leaps carries the animal over the barrier upon the 
uncharged grid of the second compartment. Here after an interval of 
qui^cence and heavy breathing the animal cautiously resumes exploratory 
behavior, much as in the first compartment. Ten minutes after the first 
top of the barriei ::he second grid is charged and the animal goes through 
mitetantially the same type of variable behavior as before. This finally 
ra^ts in a second loping of the barrier and ten minutes more of safety, 
after which this grid is again chaiged, and so on. In this way the animal 
m given fiftmi trials, eadi terminated by a leap over the barrier. 

A comparison of the animaFs behavior leading to his successive 
from the charged grid shows clear evidence of learning in 
tiiat upon the whole the time from the onset of the shock to the 
^ape b^^me jux^mively less, until at the last few trials the 
l^pin^ followed the on^t of the shock almost instantane- 

ously. MemawMle the competing reactions gradually decreased in 
until at tl^ they ceased to occur altogether. Once or 
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twice the rat even leaped the barrier before the shock was turned 
on at all. Here, then, we have a clear case of selective learning. 

It is evident from the foregoing that the final successful com- 
petition of the reaction of leaping the barrier (R^) with the various 
futile reactions of the series such as leaping against the wooden 
walls of the apparatus (72^), squeaking and biting the floor 
bars (Bs) must have resulted, in part at least, from a differential 
strengthening of R^. It is also evident that each of these com- 
peting reactions was originally evoked by the slightly injurious 
effects of the current on the animaPs feet (the condition of need 
or drive, D) in conjunction wdth the stimulation (visual, cutaneous, 
etc.) arising from the apparatus at about the time that the reaction 
took place. The stimulation arising from the apparatus at the 
time of the respective reactions needs to be designated specifically: 
leaping against the wall will be represented by Sa; squeaking, by 
Sa'I biting, by Sa"; and leaping the barrier, by Sa"^- It is assumed 
that preceding the learning, the leaping of the barrier was evoked 
by a compound connection between the receptor discharges Sb and 
Sa, arising from Sd and Sa respectively, and Rf, i.e., R^ must have 

been evoked jointly by the converging connections, Sb >Rj^ and 

Sa'" ^ R^> These are the connections which evidently have been 

strengthened or reinforced. Because of this, learning is said to be 
a process of reinforcement 

We must now approach the central problem of learning by at- 
tempting to formulate the rule according to which primary rein- 
forcement occurred in this case of selective learning. More specifi- 
cally, we must ask the rule according to which the connections 

^B > R^ and Sa'" > R 4 were differentially strengthened so as 

to become dominant over the numerous other reaction tendencies. 
The most plausible statement of this rule at present available is: 
Whenever a reaction (jK) takes place in temporal contiguity with 
an afferent receptor impulse (s) resulting from the impact upon a 
receptor of a stimulus energy (S), and this conjunction is followed 
closely by the diminution in a need {and the associated difninution 
in the drive, D, and in the drive receptor discharge, Sb) 9 there will 

result an increment, A (s > R ) , in the tendency for that stimr- 

ulus on subsequent occasions to evoke that reaction. This is the 
of primary reinforcement,^ 

Thus in the case of learning exhibited in Demonstration Ex- 

^ Actually, of course, this formulation has only the status of an hypoth^^ 
Tbe term law is here used in much the same loose way l^iat Thcmidiice 
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periment A, both of the afferent stimulus impulses, Sd and Sj,'", 
were obviously active when reaction occurred because they 
evoked it. Moreover, this conjunction of Sd and Sa"’ with R^ was 
followed immediately by the termination of the shock effects or 
need, and so by a reduction in sd- But by the principle of primary 
reinforcement just formulated this reduction in the need and drive 
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Fig. 13. Diagrammatic representation of the process of strengthening or 
reinforcing the connections between and The step-like rises 

and falls of the several horizontal lines such as those of D and Sd, the shock 
to the tissue and represent the rise from zero and the fail of the respective 
process. The arrows with wavy shafts — >) represent a physical causal 
relationship other than by way of receptor-effector stimulus evocation. Thus 
the rise of the current on the grid (Sd) causes the shock to the tissue of the 
animaJ^s feet, the response of the receptor in the skin ($d) of those regions, 
and the drive (D) or motivation to action. The separation of the foot from 
the gnd by the act of jumping iR*) terminates simultaneously the injurious 
acMcm or need, the receptor discharge (to), and the drive (D), though the 
current on the gnd remains unchanged. It is the reduction in the drive 
receptor impulse (to) and the drive (D) which are believed to be the critical 

factors in the process of reinforcement. The arrows with solid shafts ( >), 

whethCT curved or strai^t, separate or jointed, represent receptor-effector 
r^stioni^i|p6 m esi^^ce before the learning process here represented occurred. 

The ^Tows with brokmi shafts ( >) represent receptor-effector connections 

h^ in pro^ of formation. Distance from left to right represents the 
pas8i®& oi time. 


(D) will, tibirough the associated decrement in the drive receptor im- 
pute C^n), r^t in increments, >B^) and A (sjo > 

i?i), to tile teideney for such conjoined afferent stimulus elements 
and %) to evoke the reaction (B^) on subsequent occasions. 
*I1ie major dynamie factors of tins process are represented diagram- 
in Figure 13. 


^ ft m toot® &xpTes^, “law of effect,” to which the above formu- 
la IS rektM {9, p. 176). 
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the ACQXJISITIOlSr OP JSTEW EECEPTOR-EFPECTOR CONNECTIONS 

We proceed now to the consideration of the formation of a 
genuinely new receptor-effector connection. This turns out to be 
only a special case of the law of primary reinforcement which we 
have just formulated. Moreover, this type of selective learning 
may be demonstrated by means of an experiment differing only 
slightly from the one already considered at some length. 

Demonstration Experiment B. The variation of Demonstration Ex- 
periment A consists merely in the sounding of a buzzer continuously 
from a time two. seconds before the shock is turned on the grid until the 
fl-TiiTYial leaps the barrier. The course of the learrdng is much as in the 
preceding experiment up to the point where the animal has eliminated 
ail of the original acts except that of leaping the barrier. At this point, 
however, the animal begins occasionally to leap over the barrier during 
the first two seconds of the sounding of the buzzer. 

It will be evident at once that this outcome follows directly 
from the law of primary reinforcement stated above because, as 


BUZZER Sp-~ J 

INTERIOR OF APPARATUS (A) 


DRIVE AND DRIVE- RECEPTOR 
IMPULSE FROM SHOCK 


INJURY TO TISSUE OF FEET (NEED) - 


’A ''A -A ''A^ \ 


ELECTRIC CHAR<X ON GRID (SjP * 

Pig. 14. Diagrammatic representation of the dynamic factors involved in 
the setting up of a new receptor-effector connection in a selective learning 
situation. Demonstration Experiment B. With the exception of the upper 
line representing the rise and continuation of the buzzer stimulus and^ its 
receptor discharge, and the connection sb — R* in process of formation, 
this diagram is exactly the same as that of Kgure iS. 

shown by Figure 14 , the receptor d^charge (sjj) resulting firom the 
action of the buzzer vibrations (&) on the mechanisms of the 
internal ear has a conjimction (just as have Sd and with 
and the termination of the need, and that of the associated drive 
(D) and drive receptor discharge (sd), constitute a reinforcing 
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state of affairs exactly as in the first form of the experiment. Thus 
after reinforcement in the manner described there exists in the body 
of the organism a habit represented diagrammatically by the 

broken-shafted arrows of Figures 14 and 
15. 

From other experiments (p. 209) it is 
known that the receptor-effector connec- 
tions from several stimulus aggregates, all 
converging simultaneously upon the same 
reaction, tend strongly to summate (see 
p. 223 ff.). As a result the action of all 
three connections would be stronger than 
that of the entire group less any one. Be- 
cause of the abs^ence of Sd > Sn 

during the uncharged state of the grid, it 
is to be expected that the rat would at 
first wait until the delivery of the shock 
before jumping. However, with addi- 
tional reinforcements the action-evocation 
strengths of Sb and Sa'" finally become 
great enough when combined to evoke the 
reaction before the shock is delivered. 
Because this reaction is the same as that 
which usually occurs only after the onset 
of the shock, it is called antedating or 
anticipatory; since it results in total escape from the injury pro- 
duced by the shock, it is highly adaptive. 

Finally, with still more reinforcements, the receptor-effector 
connections become so strong that the relatively static connection, 

>sa'" alone will evoke the jumping reaction, i.e., 

the stimulus of the apparatus alone will evoke the adaptive re- 
s|K)nse, before the onset of either the buzzer or the shock. 





Fig. 15. Diagram repre- 
senting the results of a re- 
inforcement in which a 
quite new receptor-effector 
connection (sb — E*) 
has been set up as shown 
in Figure 14. The arrows 
with double shafts repre- 
sent the combination of 
the old and the newly ac- 
quired receptor - effector 
connections between 
and SDf respectively, and 
lU. The imbroken shafts 
represent either unlearned 
receptor-effector connec- 
tions or at least connec- 
tions in existence before 
the particular learning un- 
der consideration began. 


THE COOT)ITIONED REFLEX 

A special case of the action of the principle of reinforcement 
sfceteh^ alxive is found in the type of experiment in which there 
is up what is indifferently called the conditioned reflex or the 
reaction, D^pite a certain artificiality, the relative 
ffisapEcily of this type of experiment has permitted the isolation 
of m laige number of molar laws, particularly in the laboratory of 
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I. P. Pavlov, the great Russian physiologist (5). The typical 
conditioned-reflex experiment is conducted in such a way as (1) to 
set up new receptor-effector connections rather than, as in Demon- 
stration Experiment A, to strengthen connections already strong 
enough in combination to evoke overt reactions, and (2) to elimi- 
nate the necessity of selecting one reaction from the numerous 
varied reactions normally evoked by the conjunction of a need in 
a stimulating situation. The elimination of the complication char- 
acteristic of selective learning is brought about by the simple ex- 
pedient of reinforcing the first, i.e., the dominant, reaction of the 
potential sequence of competing tendencies normally evoked by a 
need situation. Since the process of primary reinforcement em- 
ployed in such experiments usually involves the termination of the 
need, the duration of the need is so brief that the weaker members 
of the potential action group rarely become overt. 

In order to make the distinctive characteristics of the condi- 
tioned-reflex experiment especially clear, the following example of 
conditioned-reflex learning was designed in such a way as to be 
an exact parallel to Demonstration Experiment- A. 

Demonstration Experiment C. A dog is habituated to stand in a stock 
or wooden framework resting on a laboratory table. A soft leather 
moccasin containing an exposed electric grid in its sole is laced to the dog’s 
foot. The moccasin is attached to a light hinged board which is held 
down by a coil spring. The electric circuit which conducts the alternating 
current to the grid passes across a connection which is broken when the 
dog’s footboard is lifted one inch at the point of moccasin attachment. 

The experiment is conducted as follows: A buzzer is sounded two 
seconds before a shock is delivered to the dog’s foot through file moccasin 
grid. The resulting shock produces as its dominant reaction a reflex 
lifting of the shocked foot which breaks the circuit, thus terminating 
the shock. No doubt the dog makes many other muscular contractions 
in addition to those which result in the lifting of the foot, but these are 
usually neglected in such experiments; the main point is that the foot^ 
lifting act always does take place at once. After ten minutes the buzzer- 
shock combination is repeated with the same results as before. This is 
continued until fifteen reinforcements have taken place. 

By the principle of primary reinforcement outlined above, the 
receptor discharge {sc) ^ arising from the buzzer vibrations (the 
“conditioned stimulus” or Sc) is temporally conjoined with the foot- 

^The symbols sc, su, and Eu have become (^vaitionalized in the 
ditioned-reflex literature and are here u^d with their conventicmal me^mng. 
The dot been placed above the s in sc in order also to conform to fiie 
usage of the pr^ent work. 
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lifting reaction (the so-called ''unconditioned reaction’^ or and 
this conjunction is followed at once by the termination of the shock 
or need and of the associated drive receptor impulse (s„) which 
constitutes the reinforcing state of affairs. As a result there must 
be set up an increment to the associative connection between the 
afferent receptor impulse produced by the buzzer vibrations and 
the foot-lifting reaction. Thus after a suflBcient number of repeti- 
tions there must arise the new superthreshold receptor-effector con- 
nection, Sc and the dog begins regularly to lift his foot 

promptly at the sound of the buzzer. 

Quite naturally, exactly as in the preceding experiments, there 
is also set up a connection ^between the various static stimuli aris- 
ing from the apparatus (Sa) and the reaction (Bu) because the 
former also are active in conjunction with the foot-lifting act and 
so become connected along with the so-called conditioned stimulus, 
thus: Sa As a result of this connection the dog will fre- 

quently lift his foot when the buzzer is not sounding, just as the 
rat would sometimes leap the barrier when neither the buzzer nor 
the shock was acting. In the case of the dog this unadaptive be- 
havior is discouraged by the spring which tends to hold down the 
footboard. 


THE CONDITIOrm) REFLEX A SPECIAL CASE OF ORDINARY 
LEARlSriNG REINFORCEMENT 

Demonstration Experiment C presents a fairly typical example 
of conditioned-reflex learning, though it is characteristic of the 
school of Bechterev (1) rather than that of Pavlov.^ We have 
already seen that the acquisition of a quite new receptor-effector 
connection, a phenomenon of conditioned-reflex learning, is deduc- 
ible from iiie conditions of Demonstration Experiment 0 on the 
of tim law of primary reinforcement formulated above in con- 
n^ticm witii a typical bit of selective learning (Demonstration 
Ex|^mnent A). Because of the current differences of opinion con- 
^ming the relaticmship between sdective learning and conditioned- 
ieammg, an explicit and somewhat detailed comparison of 
SB typ^ will now be made. In order to facilitate such a 

technique of Pavlov involves certain 
fee of whife must be delayed until the next cbap- 

fmr their adequate understanding will 

mid. 
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comparison, Figure 16 has been constructed to represent in con- 
siderable detail the dynamic factors here conceived to be involved 
in conditioned-reflex learning, in close parallel with the representa- 


BUZZER (S^)- 
APPARATUS (S^) 


DRIVE AND DRIVE- RECEPTOR 
IMPULSE FROM SHOCK (D.sJ 


INJURY TO TISSUE (NEED) 


ELECTRIC CHARGE ON 
MOCCASIN GRID (Sj))- 


■'c 

\ 

\ 

. \ 

S-. -V 





""r,, (lifting of foot) 


Fig. 16. The conditioned-reflex learning of Demonstration Experiment C 
represented according to the primary reinforcement formulation presented 
in the text. The terminates the shock or drive (Sd) which in turn termi- 
nates both the tissue injury and the associated receptor impulses. Presumably 
the latter is the essential constituent of the reinforcing state of affairs which 
sets up the connections so' >Eu and 5c- — 


tion of the process of selective reinforcement of Demonstration 
Experiment A as presented in Figure 13. 

A comparative' study of Figures 13 and 16 verifies explicitly 
what has already been pointed out. There it may be seen that: 

1. Simple selective learning (Figure IS) involve the selection of a 
particular reaction from numerous alternative reactions which are evoked 
more or less at random by the need and the stimulus situation jointly, 
whereas in condition^-reaction learning (Figure 16) there is only one 
(and the same) conspicuous act involved at each reinforcement tri^. 

2. Simple selective learning may involve the mere strengthenii^ of 
receptor-effector connections, already of superthr^mld str^gth before 
the b^inning of the experiment, whereas in conditioned-reflex learning 
there results, typically, a completdy new m^ptor-eff^tor (X^nn^fion. 

From this point of view, Demonstration Experiment B offeiB 
a kind of transition from the extreme of Demonsfa*aticHi Expen- 
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ments A to C, since it is clearly a case of selective learning yet it 
involves the setting up of a receptor-effector connection de novo 
and; at the same time, the presumptive strengthening of other con- 
nections already superthreshold in strength. It must be added 
that conditioned-reaction experiments may also be arranged in 
such a way as to strengthen superthreshold connections and that 
the terminal phases of conditioned-reaction learning inevitably in- 
volve the strengthening of connections set up de novo in the early 
stages of a given learning process. 

These last considerations suggest that the differences between 
the two forms of learning are superficial in nature; i.e., that they 
do not involve the action of fundamentally different principles or 
lawS; but only differences in the conditions under which the prin- 
ciple operates {6, Theorem I). This preliminary impression is 
confirmed when we make a comparison of the two situations (as 
represented by Figures 13 and 16). On one critical point both 
cases are identical — the reinforcing state of affairs in each consists 
in the abolition of the shock injury or need, together with the asso- 
ciated decrement in the drive and drive receptor impulse, at once 

after the temporal conjunction of the af- 
ferent receptor discharge and the reaction. 
This is, of course, all in exact conformity 
with the law of primary reinforcement 
formulated above (p. 71). 

We pass now to the consideration of 
an alternative interpretation of the con- 
ditioned reflex, namely, that held by Pav- 
lov {8)j its greatest exponent. This is 
represented fairly well by Figure 17, a 
diagram frequently employed to illus- 
trate Pavlov's views by American writers 
of elementary textbooks (8, p. 381; 7, 
p. 245). As this diagram suggests, Pav- 
lov agrees substantially with the ^law 
of rsdnforeement” formulated above, as well as with the ^flaw of 
as formulated by Thorndike, in holding that the conditioned 
s&aulus Se (or its trace, Sc) must have approximate temporal con- 
with the imconditioned reaction (J2«) before the receptor- 
^mneetioii can be established. Pavlov differs from the law 
of by rc^rding as the critical element of the rein- 

of affairs the occurrence of Su, in this case the onset 




ft. Ky 

Fig. 17. Conventional 
diagram of the dynamics 
of conditioned-reflex leam- 
(x^mmon in American 
textbooks of elementary 
I^’diology. The circle sur- 
romiding Re is to indicate 
that the preexperimental 
of is 

wo^m, unknown, or ncm-ex- 
Irtaat. 
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of the shock. On the other hand, the critical element in the rein- 
forcing state of affairs by our own hypothesis is the reduction in 
the drive receptor impulse (sp or Su) which accompanies the reduc- 
tion of the need, i.e., reduction of the physiological injury to the 
tissue of the feet, caused by the termination of the shock. 

It is an easy matter to show the inadequacy of Pavlovas formu- 
lation as a general theory of learning by applying it to the case 
of simple selective learning presented by Demonstration Experi- 
ment A (Figure 13). According to Pavlovas hypothesis, every one 
of the false reactions Rt, R 2 , Rsj etc., should have been reinforced 
just as should R^, because (in the conditioned-reflex terminology) 
each is evoked by the unconditioned stimulus (Sw). In that case 
if any selection at all could be expected to occur that reaction 
which takes place the most frequently would be the one selected 
rather than the reaction which actually would set in motion a 
causal sequence leading to the termination of the need. Yet innu- 
merable experiments show that, other things equal, the reaction 
which is followed at once by a diminution in a primary need will 
be selected, regardless of its original frequency of occurrence. 

It is not diflScult to xmderstand how Pavlov could have made 
such an error. His mistaken induction was presumably due in part 
to the exceedingly limited type of experiment which he employed. 
Within the range of his restricted procedures his formulation was 
consistent with all of the facts and observed relationships. Had 
he worked even a little with simple selective learning he would 
doubtless have seen his error and corrected it. Actually he seems 
not to have occupied himself greatly with the problem of the exact 
nature of the reinforcing state of affairs; he was interested mainly 
in discovering the detailed characteristics of conditioned-reflex phe- 
nomena by an intensive experimental attack. In this he was 
eminently successful. 


SUMMARY 

The infinitely varied and unpredictable situations of need in 
which the higher organisms find themselv^ make any form of 
ready-made receptor-effector connections inadequate for optimal* 
probability of survival. This natural defect of inheritei reaction 
tendencies, howevW varied, is remedied by learning. Learning 
turns out upon analysis to be either a case of the diffei^tial 
i^rengtiiening of one from a number of more or 1^ distinet r^c- 
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tions evoked by a situation of need, or the formation of receptor- 
effector connections de nx)vo; the first occurs typically in simple 
selective learning and the second, in conditioned-reflex learning. 
A mixed case is found in which new receptor-effector connections 
are set up at the same time that selective learning is taking place. 

An inductive comparison of these superficially rather divergent 
forms of learning shows one common principle running through 
them all. This we shall call the law of 'primary reinforcement It 
is as follows: Whenever an effector activity occurs in. temporal 
contiguity with the afferent impulse^ or the perseverative trace of 
such an impulse, resulting from the impact of a stimulus energy 
upon a receptor^ and this conjunction is closely associated in time 
with the diminution in the receptor discharge characteristic of a 
need, there will result an increment to the tendency for that stim- 
ulus on subsequent occasions to evoke that reaction. From this 
principle it is possible to derive both the differential receptor- 
effector strengthening of simple selective learning and the acquisi- 
tion of quite new receptor-effector connections, characteristic of 
conditioned-reflex learning as well as of certain forms of selective 
learning. Pavlov puts forward the alternative hypothesis that the 
critical element in the reinforcing state of affairs is the occurrence 
of the unconditioned stimulus. This formulation fits conditioned- 
reflex phenomena but breaks down when applied to selective learn- 
ing situations, a fact which shows it to be an inadequate inductive 
generalization. Fortunately the inadequacy of this interpretational 
detail of Pavlov's work in no way detracts from the scientific value 
of the great ma^ of empirical findings produced by his laboratory. 

NOTES 

Is the Ranforcing State of Affairs in Leammg Necessarily the ''Effect’' 
of the Act Being Reinforced? 

It alimdy beea ^ggested that the hypothesis as to the reinforcing state 
<£ afeJbcs adopted in the present work is distinctly related to that of Thorndike^s 
of *Iliomdike seems to have coined this expression because the 

ftate of affairs which has been found empirically to be necessary in order to pro- 
ranforcements, as in Demonstration Experiments A, B, and C, 
■aiwter ofdinaiy circumstances comes literally as the effect of the reaction which is 
jfiBl^oed. * 11 ^ ^ttise-andreffeet relatioDship is shown explicitly by means of 
Bjrtmm feeding from the act reinforced, e.g., E 4 in Figures 13 
14^ peAeatiy termination of the injurious action to the foot 

A strMy parallel though slightly more 
^upe 16, where taminates the current 
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on the grid by interrupting the circuit and this, in turn, terminates the shock 
to the foot and at the same time brings to an end the receptor impulses (s J arising 
from the current passing through the receptor organs buried in the tissue. 

At first thought it might be supposed that since only consistent reinforcement 
will set up stable habits, and since the reaction (R) when in a given situation 
yielding S to the receptors will be followed consistently by a reinforcing state of 
affairs only when there is a causal connection between the antecedent events and 
those which follow, the “law of effect^' would be established on a firm a priori 
foundation. In point of fact, however, this rule breaks down in the Pavlovian 
conditioned-reflex experiment where the salivary reaction of the dog can by no 
stretch of the imagination be regarded as the cause of the receipt of food which 
reduces the hunger and is commonly considered the reinforcing agent in this 
experiment. This paradox is explained by observing that some comTimt cause, 
the food, produces first the salivation and, later, the reduction in the need 
(hunger). Accordingly the reinforcing state of affairs is temporally related to 
the reaction involved in the reinforcement in strict accordance with the law-of- 
reinforcement formulation, but not as the effect of the reaction being reinforced. 
A second presumptive exception to Thorndike’s formulation of the nature of the 
reinforcement process is the conditioned kneejerk (10). The termination of the 
receptor discharge from the slightly injurious blow on the patellar tendon occurs 
because of the brief duration of the impact of the hammer, rather than because 
of the occurrence of the kneejerk. 

Despite these minor exceptions, Thorndike’s inductive generalization, as 
represented by the expression, “law of effect,” is based upon a very penetrating 
bit of scientific insight into the dynamics of adaptive situations in general- Never- 
theless the exception is probably genuine and it has seemed best to employ in 
the present work the slightly more appropriate though less colorful expression, 
law oj reinforcement. 

What Is the Critical Factor in Primary Reinforcement? 

In Figures 13, 14, and 16 it will have been observed that reductions in (1) the 
need and (2) the receptor response to the need both follow as consequences of the 
act involved in the reinforcement process, direetiy in Figures 13 and 14 and 
indirectly in Figure 16. These considerations raise the question as to which of 
the two is to be r^arded as the critical reinforcing agent; tins can be detamuned 
only when some radical experiment is performed in which one of the two is elimi- 
nated and the other remains active. The writer is aware of no critical evidence 
of tins kind. Until such becomes available the issue must remain uncertain. 
Meanwhile, in the interest of definiteness, the alternative of reduction in drive- 
receptor response is chosen for use in the present work as the more probable of 
the two. Should critical evident^ later prove this choice to be in error, a correc- 
tion can be made. In the preseirij st^e of our ignorance r^ardii^ behavior 
dynamic^^an error in either direction would 'not seem to have such far-reaching 
Eystematic implications as to render correction unduly diflieult. 

Tbe Effectiveness of Reinforcement and the Inimaty of tiie Need Involve 
in the Reinforcement 

A recent study by Finan (4) tends to support tibe view that reduciicm of ikeed 
h a critical factor in the primary rdmforcement larocess. Hife investigator 
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trained groups of albino rats to secure pellets of food by depressing a small bar in a 
Skinner-EUson apparatus (see pp. 87, 268). Each group of animals received 
the same number of reinforcements but in a different condition of food privation. 
Two days later, after the food need had been equalized, all groups were extin- 
guished. The median number of non-reinforced reactions required to produce 
a constant degree of experimental extinction were as follows : 

Hours of food privation during reinforcement 1 12 24 48 

Median number reactions to produce extinction 25 57.5 40 41 

While the above extinction values indicate that the relationship is not a simple 
increasing function of the number of hours' food privation at the time of rein- 
forcement, they do show that for some hours after satiation there is a progressive 
increase in the effectiveness of reinforcement; thus the two phenomena are shown 
definitely to be connected. 


The Onset, Versus the Termination, of Need-Receptor Impulse as the 
Critical Primary Reinforcing Factor 


The view has been put forward in the preceding pages that the termination of 
need-receptor impulse is the critical factor in the primary reinforcement process. 
Many students in this field, however, have held the view that reinforcement is 
critically associated with the onset of the need or drive, as represented by the 
phyaological shock in Demonstration Experiment C. 

The e'^dence from innumerable selective learning experiments, as t 5 rpified by 
Demonstration Experiments A and B, leaves little doubt as to the soundness of 
the need-reduction generalization. This does not necessarily mean that the 
need-onset hypothecs is false; there may be more than one mechanism of rein- 
foK^ment. While such seeming improvidence in biological economy appears 
somewhat opposed to the principle of parsimony, it is not without parallel in other 
** fidds; most organisms possess more than one means of excretion, and some 
oi^misms poss^ more than one independent means of reproduction. Such 
general considerations merely pose the question and warn us of multiple possi- 
biliti^; they cannot be decisive. 

Turning to experimental evidence now availableVe find that selective learning 
of the iype shown in Demonstration Experiments A and B usually yields results 
cxmristent only with the termination hypothesis. On the other hand, the results 
firom (xmditioib^resflex experiment, typified by Demonstration Experiment C, 
are ccK^stent with either hy^thesis. This ambiguity probably arises from the 
ImW duration of the shock usual in such experiments; the onset and termination 
oi occur so close together that it is difficult clearly to distinguish the 

mflneiMse of each. For example, it is quite possible that the critical reinforcing 
in tiie <x)nditioned kneejerk experiment {10) often cited in this connection 
(5, p. 85) may be the termination of the receptor discharge resulting from the 
isiaBy ratiter severe blow on the patellar tendon, which is the unconditioned 
to evoke this reaction. The matter is complicated still further by 


te n aan subjeel^ are ^^ployed in the investigations. Thus the only 
imm available seems to favor the reduction or termination 
from carefully designed and executed 
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c.q)erimente perliaps involving surgical interference with portions of the nervous 
system. 

Intheinterim we shall proceed on the positive assumption that the termination 
of the need (or of its closely correlated receptor response) is a primary reinforcing 
factor; this hardly seems open to doubt. Even if the onset of the need, or of the 
correlated receptor response, proves to have genuine reinforcmg capacity, the 
dynamics of behavior are such that it would not have much adaptive value. 
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CHAPTER VII 


The Acquisition of Receptor-Effector Connections — 
Secondary Reinforcement 

The sequences of learned behavior considered in the last chap- 
ter were all very short when perfected, only two or three seconds 
at the most being required for their execution. These examples 
were chosen not because they were especially typical of mammalian 
behavior in general, but because they were relatively simple and 
so lent themselves readily to an introductory exposition of the 
principles of learning. We must now explicitly recognize the fact, 
confirmed by universal observation of the everyday behavior of 
ani mals including ourselves, that a great deal of behavior takes 
place in relatively protracted sequences in which primary reinforce- 
ment normally occurs only after the final act. Evidence will be 
presented in a subsequent chapter (p. 139 ff.) showing that rein- 
forcemait probably must follow a receptor-effector conjunction 
isCs) within about twenty seconds if it is to have an appreciable 
effect Consequently direct or primary reinforcement, as such, is 
inadequate to account for a very great deal of learning. Fortunately 
an ingenious series of esperiments performed in Pavlov’s laboratory 
in Petrograd has 3 delded a principle which explains these more re- 
mote reinforcements. This supplementary reinforcement principle 
is called secondary reinforcement. In the present chapter we shall 
consida: the nature, ori^, and elementary functioning of this 
extremely important principle. 

I^MONSmmOlT OF THE ESaSTENCB OF SECOIIDART 
BBINPOBCBMENT 

Becau% the principle of secondary reinforcement was first iso- 
latoi frmn the results of conditioned-reflex experiments, we shall 
b^jn OTT |si^entation with an illustrative example from Pavlov’s 
iabaratory. The expemnent was performed by Dr. Frolov; the 
only amount of this expaiment available to English readers is 
that of Pavlov (7, p. 34), who unfortunately omitted many of the 

84 
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details necessary to an introductory account of this type of experi- 
mentation. For the benefit of those unfamiliar with the method- 
ologies of conditioned-reflex laboratories, these accessory details 
are here supplied from the accounts of other relevant experiments. 

Dr. Frolov experimented with a dog, one of whose salivary 
glands had been diverted surgically so that the saliva discharged 
through a fistula in the side of the animaFs face instead of flowing 
into its mouth. Suitable apparatus was provided for the precise 
determination of the number of drops secreted within a given time 
interval. When hungry this dog would be presented with the tick- 
ing of a metronome for a minute or so, and after 30 seconds meat 
powder would (presumably) be blown into its mouth; the powder 
would then be eaten by the dog, a considerable quantity of saliva 
evoked by the incidental gustatory stimulation and chewing activ- 
ity at the same time flowing from the fistula. After numerous 
reinforcements of this kind it was found that the metronome acting 
alone for 30 seconds evoked 13.5 drops of saliva; this is an ordinary 
or ^^first-ordeF^ conditioned reflex. The above account presents a 
fairly typical picture of conditioned-reflex learning by the Pav- 
lovian technique. 

Next, a black square was presented in the dog^s line of vision 
for the first time; no saliva flowed from the fistula during this 
stimulation. Following this test the black square was held in front 
of the dog for 10 seconds, and after an interval of 15 seconds the 
metronome was sounded for 30 seconds, no food being given. The 
tenth presentation of the black square (alone) lasted 25 seconds; 
during this period 5.5 drops of saliva were secreted. This is an 
example of a ^^higher-order'^ conditioned reflex. 

The conditions of Frolovas experiment show that the visual 
stimulation resulting from the pr^entation of the black square 
had in some way acquired from association with the metronome 
stimulation the capacity to evoke the salivary secretion independ- 
ently. Since the presentation and consumption of food were not 
associated with the acquisition of the second conditioned reaction, 
it is assumed that during the original conditioning process the 
metronome had not only acquired the capacity to evoke the flow 
of saliva but had also acquired the capacity itself to act as a rein- 
forcing agent. The metronome is accordin^y said to be a secondary 
reinforcing agent. For analogous reasons the i^ulring r^:^pto- 

effector connection (black square salivation) set up by this 

m^ns is said to be a se&md-order conditioned reaction. 
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SOME PROBLEMS CONCERNING SECONDARY REINFORCEMENT 

Frolov's experiment demonstrates in an unambiguous manner 
the genuineness of secondary reinforcement, a first-rate scientific 
achievement. Unfortunately, even when considered together with 
the other Russian experiments in this field it leaves unanswered 
numerous questions concerning the conditions necessary and suf- 
ficient for secondary reinforcement to occur. 

1. The reaction conditioned to the black square was qualitatively the 
same as, though weaker than, that previously conditioned to the metro- 
nome. Is this typical of secondary reinforcement, i.e., is secondary rein- 
forcement confined to the transfer of the same reaction from one stimulus 
to another, or may any receptor-effector conjunction be connected by 
secondary reinforcement? 

2. In Frolov's experiment the metronome purports to have served a 
double function: (a) that of evokiny the reaction (salivation) which was 
secondarily conditioned; and (b) that of reinfoTcing the conjunction of 
the salivation thus evoked and the receptor discharge produced by the 
presentation of the black square. Is this apparent duplication of function 
by the metronome genuine and, if so, is it a characteristic or necessary 
part of the secondary reinforcement process? 

3. The receptor-effector conjunction involved in the setting up of 
tl^ second-order conditioned reflex was associated temporally not only 
mth. the stimulation of the ticking metronome, but also with the evoca- 
tion of the reaction already conditioned to the latter. This leads us to 
ask: Are both of these events, i.e., both the presentation of the metronome 
stamulus and the evocation of the reaction conditioned to it at the time 
it acquired the power of being a secondary remforcing agent, necessary 
for the secondary reinforcement to occur, or is only one of these events 
nece^iy for secondary reinforcement, and if only one, which one? 


These and a number of other questions of a somewhat similar 
nature arising from the Russian experiments in this field must be 
examined. Before doing this, however, one or two general remarks 
may be made concerning the situation as a whole. To a certain 
^nt th^ questions arise because of the distinctly artificial 
nature of the conditioned-reflex experimental procedure. While in 
no way detracting from the scientific significance of these investi- 
gatotts, iheir artificiality probably does in some cases interfere 
wr e recogmtion of their bearing on the adaptive dynamics of 
life situations.. Accordingly, while considering the prob- 
“ attempt will be made gradually to place the 
J^of ^Mmdary reinforcement in a more natural functional 
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WHAT REACTIONS MAY BE SECONDARILY REINFORCED? 

We shall begin with the first of the above questions — ^whether 
the reaction involved in secondary reinforcement must necessarily 
be the same as that already conditioned to the secondary reinforc- 
ing stimulus, or whether any reaction whatever may be so connected. 
In order to find an answer to this question it will be necessary to 
consider some experiments which differ considerably from the type 
employed by Pavlov. The first of these has been reported by Skin- 
ner { 9 , p. 82) . 

This investigator fed a hxmgry rat tiny pellets of specially pre- 
pared food by activating a food magazine which dropped one pellet 
into a food cup at each activation. On each occasion the action 
of the magazine produced a clearly audible sound vibration. In 
the course of such training rats soon learn to interrupt whatever 
they are doing when the magazine vibration occurs, go directly to 
the cup, and eat the pellet. After 60 pellets had been given in this 
way, the food was removed from the magazine and the rat left to 
itself. 

A horizontal brass bar projected from the wall of the apparatus 
several centimeters above the food cup. In its explorations around 
the food cup the still hungry rat was almost certain sooner or later 
to stand up on its hind legs and rest its front paws on this bar, 
which was so delicately pivoted that even a light downward pres- 
sure would depress it. Moreover, the bar was so connected with 
the food magazine that this downward movement would activate 
the food-release mechanism with its characteristic sound vibration. 
Because of the preceding training this vibration would at once cause 
the animal to search in the cup for a pellet; however, no pellet 
would be found, because in this phase of the experiment the food 
magazine was empty. This being the case it might be supposed 
that the act of pressing the bar would not be reinforced. 

Skinner ran four rats through this experiment and is of the 
opinion that the click really did reinforce the bar-pressing act as 
contrasted with innumerable other acts evoked by the mtuation. A 
record made by one of these animals is reproduced as Figure 18. 
Each small unit in the rise of this curve repr^ents one operation 
of the lever by the rat. It may be seen that the reactions, as shown 
by the slope of the curve, occurred with about maximum frequency 
at the outeet of the process and thoi gradually cea^d as the curve 
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became horizontal. Since no food was given, this learning is pre- 
sumably the result of secondary reinforcement. 

Unfortunately, Skinner publishes no account of an appropriate 
control experiment to show how frequently a comparable animal 
would depress the bar if the sound of the action of the magazine 
had also been eliminated. Nevertheless, much corroborative evi- 
dence from other investigations supports Skinner^s view that gen- 
uine learning took place. 
One such investigation is 
reported by Bugelski (i), 
Bugelski performed an 
experiment in which he 
trained two comparable 
groups, each of 32 albino 
rats, to press a bar for a 
food-pellet reward in an 
apparatus much like that of 
Skinner. At the completion 
of training, the bar-pressing 
habits of both groups of 
animals were extinguished ^ by so adjusting the apparatus that 
pressure on the bar was no longer followed by the delivery of the 
food pellet. With one group, however, the. depression of the bar 
was followed at once by the customary click of the food-release 
mechanism, but with the other group it was not. Bugelski found 
that the click-extinction group, as a whole, executed a little over 
30 per cent more pressures on the bar before reaching extinction 
than did the non-click group. This indicates in a convincing man- 
ner the power of a stimulus (the magazine click) closely associated 
with the receipt of food to contribute to the maintenance of a 
r^eptor-effector connection at a superthreshold level. 

The Sdnner and Bugelski studies, taken jointly, enable us to 
^iswer the first two of the questions raised by the Russian secon- 
dary-reinforeem^t experiments. On the analogy of the Pavlovian 
type of <x>iiditioned-r€flex experiment, the vibrations of the food- 
relea^ apparatus in Skinner^s experimeit may be assumed to have 
conditioned to salivary secretion and other phases of the 

* Hie of mil be taken up in detail in a later chapter 

For prt^at it may merely be said that when a learned 
m and the evocation is not followed by reinforce- 

gratfemlly loses its capacity to evoke the reaction. This 
K»i m immm m &Umctkm. 



Fig. 18. Record of secondary-reinforce- 
ment learning - in which the act learned 
(prei^ure on a bar) was distinct from that 
primarily conditioned (putting head down 
to food cup, salivating, etc.). (After Skin- 
ner, 3, p. 83.) 
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eating process, as, clearly, was the tendency to put the head down 
to the cup and search for the food, by 'primary reinforcement.^ On 
the other hand, the act secondarily reinforced in the Skinner experi- 
ment was that of depressing a bar. Two acts could hardly be more 
diverse than sniffing in a food cup and depressing a bar with the 
paws. We conclude, then, that the identity of the act in the pri- 
mary and the secondary reinforcement of the Russian experiments 
was an accidental condition and that such an identity is not a 
necessary characteristic of secondary reinforcement in general. 
Thus the first of the above questions is answered in the negative. 

The two experiments just examined also enable us to answer 
the second question posed by the Frolov experiment. Since the act 
secondarily conditioned was clearly different from that conditioned 
to the secondary reinforcing stimulus, it follows that the double 
role of evoking the reaction to be conditioned and at the same time 
serving as a reinforcing agent is not necessarily characteristic of 
secondary reinforcing agents in general. 

MUST THE SECONDARY REINTOBCING STIMULUS EVOKE ITS 

coisrumoN-ED REAcnoisr m order to act as 
A REIEFORdEG AGENT? 

The finding of an answer to the third of our questions involves 
a determination of the differential causal efficacy in secondary rein- 
forcement of two events which usually occur together, namely, the 
so-called secondary-reinforcing stimulus and its conditioned reac- 
tion. The problem could be solved readily enough if we could 
devise some way of presenting in a situation known to be capable 
of reinforcement each of the factors separately, and oteerving 
whether or not learning does in fact occur in each case. The diffi- 
culty lies primarily in accomplishing the complete and certain 
elimination of the one factor while retaining the complete integrity 
of the other. While no investigations have been found which were 
deliberately directed to a solution of this problem, there are two 
or three which throw indirect light upon it. One of th^, reported 
by Cowles (S) , purports to have eliminated from the secondary 
reinforcement situation at least the gross overt reaction conditioned 
to the secondary reinforcing stimulus. 

Cowles readily trained two chimpanzee to insert l%.-inch col- 
ored didm into a slot machine which ddivered a raisin for each 

^ However, see terminal note entitled, **The Ste^tus of Food Reward as a 
Rdnfordng Agent.^^ 
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disk inserted. After this training the animals would retain, hoard, 
and even expend considerable amounts of energy to secure the disks, 
evidently as subgoals. Later each animal was presented with the 
task of learning which of a row of five small, lidded boxes con- 
tained concealed within it one of these tokens. Each learning 
session consisted of 20 trials, all of which had to be completed 
before the animal could exchange its tokens for raisins at the slot 
machine which was located in a room about 35 feet distant. Each 
series of 20 trials was devoted to a separate box, so that old habits 
of choosing other boxes on previous training series had to be broken, 
and new ones substituted, at every succeeding session. Finally on 
alternate sessions, in order to secure a measure of the relative 
strength of primary and secondary reinforcement, instead of a 
token real food in the form of a raisin was put in the box. It was 
found that the average score of the two apes on the second half 
of each 20 trials on a given box (where chance success alone would 
yield 20 per cent correct choices) was 74 per cent for food tokens 
and 93 per cent for the food. This shows an amount of learning 
due to secondary reinforcement which is fairly comparable to that 
mediated by the primary food reward itself. 

In this experiment the overt act normally evoked by the sec- 
ondary reinforcing agent, the food token, was that of inserting the 
token in the slot machine, whereas the act involved in the process 
of secondary reinforcement was that of lifting the lid of a par- 
ticular box in a row of five. Under the conditions of this multiple- 
choice learning the slot machine was in a different room, and the 
animals literally could not carry out the act of inserting the token 
for some time after obtaining it. Moreover, Cowles reported no 
tendency on the part of the apes to execute movements such as to 
insert the disks into an imaginary vending machine. In so far 
the Cowl^ experiment seems to yield a negative answer to the 
third query su^^ted by Frolov’s experiment; i.e., it suggests that 
the occurrence of the first-order conditioned stimulus is necessary 
for the setting up of a second-order conditioned reaction, and that 
the occurrence of the primary conditioned reaction is not necessary. 

THE EFIBCT OF EXUUNCTIOrr OF THE PRIMARY RECEPTOR- 
BFIKTOB OOKNEOTON OK BECO]ST>ARY REIKFORCEMBNT 

the r^ults from Cowl^’ experiment, it would be rash 
to e^mclude at once that the evocation of some fractional compo- 



SECONDARY REINFORCEMENT 


91 


nent of the reaction originally conditioned to the secondary rein- 
forcing stimulus was not present at each secondary reinforcement. 
The principles of reinforcement learning lead a 'priori to the expecta- 
tion that salivation, and probably many other hidden internal 
processes such as the galvanic skin reaction, must have been con- 
ditioned both to the stimulus energies arising from the vending 
machine and to those from the food tokens. Moreover, we saw 
above that in Frolovas experiment salivary secretion accompanied 
every secondary reinforcement of the black square. The fact that 
such processes were not observed in Cowles’ experinient argues little 
against the formidable probability that they did in fact occur. Had 
not special apparatus and procedures been employed, the salivary 
secretion would not have been observed in Frolov’s experiment 
either. On the positive side, Cowles (S) reports that the apes em- 
ployed in his experiments showed a marked tendency to put the 
food tokens in their mouths. Wolfe (IB, p. 16) reports that both 
food and food tokens would elicit anticipatory lip-smacking activ- 
ity, but a brass non-food token of the same shape would not. It 
is clear from the above considerations that further evidence will 
be required to determine whether the presence of the reaction com- 
ponent of the usual secondary reinforcing situation is a necessary 
condition for the occurrence of secondary reinforcement. 

We saw above in connection with the Bugelski experiment (p. 
88, footnote) that receptors which frequently evoke a conditioned 
reaction without accompanying reinforcement presently lose the 
power of evoking this reaction. Experimental extinction accord- 
ingly offers a means of separating a secondary reinforcing stimulus 
from the reaction to which it was conditioned while it was acquir- 
ing its secondary reinforcing powers. This circumstance makes 
possible the presentation of the former without the latter in close 
temporal proximity to a reinforcible receptor-effector conjunction. 
Such a combination of circumstances occurred in a quantitative 
experiment reported by Grindley (5) . 

In one of his experiments Grindley placed young chickens at 
the beginning of a four-foot runway. At the other end were placed 
grains of boiled rice. Half of the chickens were permitted to find 
their way down the runway and eat the rice. The other half were 
likewise permitted to go down the runway, but found a plate of 
glass placed a few inches above the rice which prevented them from 
eating the grains. An index based on the time required by each 
of the two groups of chickens to travel^ the runway on each of 
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the 12 successive trials is shown in Figure 19. Our interest is con- 
fined mainly to the scores of the group of chickens which saw but 
could not eat the rice. This curve shows that for the first four 
or five trials the non-eating chickens gained in speed nearly as 
fast as did the rewarded ones. During the subsequent trials, how- 
ever, the non-rewarded chickens gradually lost their rate of loco- 


motion until at the eleventh and twelfth trials they did scarcely 



better than the chance per- 
formance of the very first 
trial. 

These results of Grind- 
ley’s experiment are dupli- 
cated by Skinner’s record 
(Figure 18), which is so 
constructed that the flatten- 
ing out of his curve to a 
horizontal represents ex- 
perimental extinction after 
what appears to be a rather 
abrupt learning. Pavlov 
reports the same phenom- 
enon in the conditioned- 
reflex learning situation; 
extinction was encountered 
by him especially when he 


Fig. 19. Graplis showir^ in parallel the 
of the learning of young chickens 
to traverse a straight four-foot runway by 
piinaary and by secondary reinforcement 
which was unaccompanied by the original 
mipportmg prima r y reinforcement. (After 
Grindl^, 3, p. 179.) 


attempted to set up third 
and fourth order condi- 
tioned reflexes. 

The experimental results 
just considered, as well as 
those from numerous con- 


cordant experiments of both 
concfilaoii^-reflex and selective learning, all point to the same con- 
clusiony namely, that a secondary reinforcing agent (in Grindley’s 
experim^t ihe visual stimulus pr^ented by the rice grains) loses its 
power of ^ondary reinforcement when it loses its power of evoking 
tte ruction ccmditioned to it at the time it acquired its power of 
^cHwlarj rrinforc^ait; moreover, it would appear to possess the 
of r^formnent cmly to the degree that it possesses the 
ppw of evcteng "Uiis reaeti<m. It follows that in cases in which the 
agent is still strong, secondary reinforcement 
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makes some progress. Very soon, however, it weakens through ex- 
perimental extinction, reinforcement is thereby withdrawn, and the 
secondary reinforcement declines along with the strength of its 
parent receptor-effector connection. 

Granting that the occurrence of at least some component of the 
reaction conditioned to the stimulus in the usual secondary rein- 
forcing situation is necessary for secondary reinforcement to occur, 
our original question is still far from answered. Even if the reac- 
tion is necessary, this does not mean that it is sufficient. This 
confronts us with the question of whether the stimulus also is neces- 
sary. The empirical solution of the latter problem would require 
that the reaction conditioned to the stimulus of a secondary rein- 
forcing situation somehow be presented without the evoking stim- 
ulus, in close temporal proximity to a reinforcible receptor-effector 
conjunction, and that a determination be made as to whether or 
not learning takes place. No experiments in which this was at- 
tempted have been found. Judgment in this intricate but theo- 
retically important matter must accordingly be held in abeyance 
until more adequate evidence becomes available. 


THE POSSIBILITY OF SECONDARY EEINFOECEMEISrTS ABOVE THE 
SECOND AND THIRD ORDERS 

A fourth question raised by the Russian experiments on secon- 
dary reinforcement concerns the possibility of setting up condi- 
tioned reactions above the second order. In discussing this question 
Pavlov remarks (7, pp. 34-35) : 

It was found impossible in the case of alimentary reflexes to pre^ 
the secondary stimulus into our service to help us in the establishment 
of a new conditioned stimulus of the third order. Conditioned reflex^ of 
the third order can however be obtained with the help of the second 
order of conditioned reflexes in defence reactions such as that against 
stimulation of the by a strong electric current. But even in this 
case we cannot proceed further than a conditioned reflex of the third order. 
... In these conditioned reflexes, pa^iog from the first to the third order, 
the latent period progresavely increa^. In the same order we pa^ 
from the strongest to the weakest omditioned d^enc^ idBex. 

The above passage from Pavlov has been quoted at length be- 
cause there is reason to believe that while the facts there roported 
are well authenticated, their adaptive implications have occasion- 
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ally been misunderstood, possibly even by Pavlov himself. This 
presumptive error in interpretation seems to have come about 
through the distinctly narrow and artificial nature of the condi- 
tioned-reflex experimental procedure by means of which it was 
investigated, much as is believed to have been the case in the deter- 
mination of w^hat constitutes a primary reinforcing state of affairs 
(p. 78). In order to prove that secondary reinforcement is a 
genuine phenomenon it is necessary to remove all possibility of 
primary reinforcement from the situation. This, of course, inciden- 
tally causes extinction of the primary receptor-effector connection, 
which, as we saw in Grindley's experiment, soon leads to the loss 
of the power of reinforcement by the secondary reinforcing agent 
and thus tends to bring the potential chain of transfers of the power 
of reinforcement to an early termination. As indications of this 
we note Pavlov’s statement that each successive reaction in the 
chain had a progressively longer latency, itself an indication of 
receptor-effector weakness which is also separately noted by him. 
Viewed in the light of these considerations the limitation in the 
number of higher-order conditionings was presumably a mere arti- 
fact of the technical procedure employed in the investigation of 
the problem. 

We now have evidence which indicates that the gradient of 
reinforcement does not extend backward from the reinforcing state 
of affairs in detectable amounts beyond about 20 seconds (8 ) . This 
means that in protracted behavior sequences, even with primary 
reinforcement fully intact, the direct effects of the latter are auto- 
matically excluded for all receptor-effector conjunctions beyond a 
half minute or so from the point of such reinforcement. Since in 
the higher organisms behavior sequences, e.g., the pursuit of prey, 
often continue far beyond such a limit, it follows that the strength 
of the earlimr segments of such sequences must frequently be main- 
tain^ by very long chains of secondary reinforcing situations. In 
normal human organisms it would appear from such considerations 
that higher-order conditioning yields receptor-effector connections 
which are comparable in vigor with those produced by primary 
reinfomement, and that there is practically no limit to the degree 
to which higher-order conditioning may be carried under suitable 
conditicms. As for ihe latter, it may be said that with the excep- 
tion of tile nature of the reinforcing state of affairs involved the 
^mrnrn i^^ary for secondary reinforcement are the same as 
th^ for primary reinforcem^t. This is to say that a receptor 
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impulse will acquire the power of acting as a reinforcing agent if 
it ocawrs consistently and repeatedly within 20 seconds or so of a 
functionally potent reinforcing state of affairs^ regardless of whether 
the latter is primary or secondary. 


THE BOLE OF SECOHDABY RBIHFORCEMEITT IH COMPOUNI) 
SELECTIVE LEARJ^-ING 

As a means of showing something of the systematic and func- 
tional significance of secondary reinforcement we shall now con- 
sider a few of its more elementary implications. It has already 
been suggested (p. 84) that secondary reinforcement plays its 
major role in protracted behavior sequences, particularly in those 
portions of them which precede the point of primary reinforcement 
by more than 20 seconds or so. A typical sequence of this kind is 
found in compound selective learning. This may be thought of as 
a series of trial-and-error learning situations in which each link 
in the series consists of a simple selective learning situation pos- 
sessing the general characteristics of Demonstration Experiment A. 
It may further be assumed that at the beginning of learning 20 
seconds or more are consumed by the organism before the correct 
reaction is performed at each of five choice points; that the correct 
reaction in one situation always leads at once to the next situation 
in an invariable order; that the new situation instantly activates 
the receptors of the organism in a distinctive way; and that the 
entire sequence finally culminates in the complete satiation of the 
need which motivated the organism throughout the total activity. 
Following common-sense usage, we shall call this final primary 
reinforcing state of affairs the goal and represent it by the letter G, 
If there are five segments in such a behavior sequence, we may 
represent the correct reaction of the first trial-and-error situation 
by Rx, that of the second by Rb, and so on to Rg. In a similar 
manner the gross stimuli presented to the organism by the respec- 
tive situations may be represented by the parallel notation, >Si, 

Ss, Sj^, and /S 5 . 

It follows from the conditions of compound selective learning 
assumed above that the organism by sheer trial-and-error will 

work blindly down through the series until Ss >^s evokes Rst 

which by physical causation produces G, the primary reinforcing 
state of affairs. This, by the principle of primary reinforcement, 
begins to set up the connection Ss > Ss >Rs s-nd at the same 
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time, by the principle of secondary reinforcement, to endow Ss with 
the powers of (indirect) reinforcement. Now, is too remote 
in time from G to be selected by primary reinforcement. However, 
in- the course of subsequent executions of the sequence by the organ- 
ism the secondary reinforcing power recently acquired by will 
mediate the connection Sj, — ^ ^ at the same time endow- 

ing S]^ with secondary reinforcing power. In the same way sec- 
ondary reinforcement will move progressively backward from to 
Ssy from Ss to Sg, and finally from Sz to Si, ultimately resulting in 
the tightly knit, errorless behavior sequence shown in Figure 20. 

It is clear from the preceding that compound selective learning 
must necessarily progress in a backward manner from the point 
of primary reinforcement. This means that in the situation under 
consideration errors would be eliminated most quickly at 85 ^ next 


Fig. fbO. Diagrammatic representation of the final phase of a case of 
compound selective learning in which simple trial-and-error occurs five times. 
If at the ba nnin g of learning the occurrence of each correct R be assumed 
to consume at least 20 seconds then the four segments at the left of the 
figure would be dependent for their selection purely upon secondary rein- 
forcement. In that event St would be the secondary reinforcing agent effect- 
ing the selection of Rt^ and St would be the secondary reinforcing agent 
effecting the selection of Rt. As in Figures 13, 14, and 16, the sign > 
represents a non-phydological causal connection. 


most quickly at and most slowly of all at St, the rate-of-leam- 
ing score at the several points presenting a gradient whose highest 
point would be at Sj. That such a gradient exists in compound 
selective learning situations is well known, though it may be over- 
ridden by various known factors. Because of the relation of this 
gradient to the point of primary reinforcement or goal, the back- 
ward ord^ of the elimination of errors has been called the goal 
p^adient (5). We shall recur to this subject in another connection 
(p. 142). 

ITiere should also be noted the implication in the above analysis 
that certain circumstances in the learning situation may effectively 
prevent learning from occurring, particularly in the segments an- 
terior to 8 s - > Ss — Ms- One of the most important of these is 

a ^lay in fte occurreice of Ss, say, following reaction The 
pri^aples ^vdoped ^rlier in the chapter imply that in an ideal 
hi wMch 8 t, 8 s, 8 ^, and Ss are all totally different, a 

mlmj of m or more will prevent learning at any point in 
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the series anterior to Ss. On the other hand, if Sx, Ssj and 
all contain an important component found in S 5 (a situation easily 
set up experimentally) then each will become a reinforcing agent 
regardless of the temporal gaps in the series. 

SUMMARY 

Primary reinforcement, because the range of its gradient is 
limited to 20 or 30 seconds, is incapable of explaining the learn- 
ing which manifestly occurs when the receptor-effector processes 
involved are temporally very remote from the relevant need reduc- 
tion. These latter reinforcements are explained by the discovery 
that the power of reinforcement may be transmitted to any stimulus 
situation by the consistent and repeated association of such stim- 
ulus situation with the primary reinforcement which is character- 
istic of need reduction. Moreover, after the reinforcement power 
has been transmitted to one hitherto neutral stimulus, it may be 
transferred from this to another neutral stimulus, and so on in a 
chain or series whose length is limited only by the conditions which 
bring about the consistent and repeated associations in question. 
The inability of Pavlov and his pupils to obtain conditioned reac- 
tions above the third order appears to have been due to the highly 
artificial nature of their experimental procedures which did not pro- 
vide the necessary conditions for long and stable secondary-rein- 
forcement chains. 

Our detailed findings concerning secondary reinforcement may 
be listed as follows: 

1. Perhaps the most strijdng characteristic of secondary reinforce- 
ment is that it is itself a kind of by-product of the setting up of a 
receptor-effector connection, in the fimt instance through primary rein- 
forcement. Primary reinforcement, on the other hand, appears to be a 
native, unlearned capacity in some way ai^ociated with need reduction. 

2. Secondary reinforcement may be acquired by a stimulus from 
a^ciation with some previously ^ablished secondary reinforcement, as 
well as with a primary reinforcement. It would appear that transfer of 
Ibis power of reinforcement from one stimulus situation to another may 
go on indefinitely, given the conditions of stable and consistent a^ociation. 

3. A receptor-effector conjunction rDvolving any effector may be 
forced by any secondary reinforcing atuation. 

4. Secondary reinforcement diffeis from primary reinforcement in that 
the former seems to be a^ociaied, at least in a molar ^nse, with stimula- 
tion, whereas the latter seems to be associate with the <^^tion of 
stimulation, Le., of the 

5. Stimuli which acquire ^(mndary r^iforeing power ^em always to 
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acquire at the same time a conditioned tendency to evoke an associated 
reaction. The available evidence indicates that as such stimuli lose 
through extinction the power of evoking this reaction, they lose in about 
the same proportion their power of secondary reinforcement. It is 
probable also that the reverse is true; that a stimulus gradually acquires 
its powers of secondary reinforcement as it acquires the power of evoking 
the reaction conditioned to it. 

6. It follows from (5) that a stimulus alone is ineffective as a secondary 
reinforcing agent. We do not know whether the reaction evoked by thk 
stimulus, or possibly some critical fraction of that reaction if presented 
without the stimulus, would serve as a reinforcing agent. Conceivably, of 
course, the combination of both the stimulus and its conditioned reaction, 
or some fractional component thereof, is necessary to effect secondary 
reinforcement. 

7. It is apparent from the preceding that a reaction conditioned to a 
stimulus which has as a by-product acquired secondary reinforcing power, 
at a point several secondary-reinforcement links removed from the point 
of primary reinforcement, may suffer experimental extinction in two ways: 
(a) through the evocation of the reaction not being followed by the parent 
secondary-reinforcing stimulus, and (6) through the evocation of the 
reaction being followed by the parent stimulus after the latter has lost its 
secondary reinforcing power through, say, the extinction of its own con- 
ditioned reaction. 

8. With the main facts of secondary reinforcement before us we may 
now refonnulate the law of reinforcement in such a way as to include the 
wider learning potentialities inherent in secondary reinforcement: 

Whenever an effector activity occurs in temporal contiguity 
with the afferent impulse, or the perseverative trace of such an 
impulse, resulting from the impact of a stimulus energy upon a 
receptor, and this conjunction is closely associated in time with 
the diminution in the receptor discharge characteristic of a need 
or with a stimulus situation which has been closely and consistently 
associated with such a need diminution, there will result an incre^ 
men! to the tendency for that stimulus to evoke that reaction. 

NOTES 

The Status of Food Reward as a Reinforcing Agent 

We have seen in Ihe preceding p^es that primary reinforcement originates 
in IIbb rednchon of a primary need. Now, the primary need in the case of food 
privaM<m pr^nmahly conrists in the requirement of the cells of the body for 
just as tte primary need in the case of the shocked animfllH in Demon- 
Exp^hnmits A, B, mid C was the c^sation of the action of the electric 
^in^t mt ci tte animals’ feet. It is evident from a knowledge of nutri" 

there fe mi appi^^^ble dday between the beginning of the 
oi food fije ulrimate redaction in the nutrient need of the body. 
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cells while mastication, deglutition, digestion, and absorption are taking place. 
This makes it distinctly improbable that the presentation of food, its mastication, 
and the gustatory stimulus incidental to mastication are primary reinforcing 
states of affairs. Yet innumerable experiments have shown that the presentation 
and mastication of food in fact constitute a powerful reinforcing combination. 
These considerations strongly suggest that the eating of food as such brings about 
learning through secondary reinforcement rather than through primary reinforcement 

Incidentally, this hypothesis explains the paradox that whereas other cases of 
clear primary reinforcement appear to be associated with reduction in drive 
stimulation {Sd sn), food reinforcement appears at the instant of reinforcement 
to be associated rather with stimulation or the onset of stimulation which, as we 
have seen above, is characteristic of secondary reinforcement. This may accoimt 
for Pavlov's view that reinforcement is primarily a phenomenon of stimulation, 

A Possible Technique for Determining the Status of Food Reward as a 

Reinforcing Agent 

Because of the wide use of food reward as a reinforcing agent in behavior 
experiments its status in this respect is in especially urgent need of clarification. 
There is reason to believe that a sham feeding experiment might contribute 
materially to this end. The esophagus of a dog could be severed and both the 
upper and lower sections converted into fistulas opening through the skin of the 
dog’s neck. With such an arrangement it would be possible to feed the dog and 
either have the masticated food fall into a conveniently placed receptacle or have 
it pass through a tube connecting one fistula with the other and thus enter the 
stomach and ultimately reduce the need of the body cells for nutriment. Since 
the various receptor discharges associated with the eating of food, its swallowing, 
digestion, and absorption, have throughout the entire life of each organism been 
associated in a uniform and practically invariable sequence with ultimate need 
reduction, it is to be expected that the stimuli associated with mastication would 
have acquired a profound degree of secondary reinforcing power. For this reason 
the power of secondary reinforcement ought not to be lost by such stimuli througji 
a moderate amount of experimental extinction. Hpwever, if the present hypothe- 
sis is sound, a dog sham fed on one kind of food and really fed on an equally rein- 
forcing kind of food should, after a time, show a distinct preference for the food 
which mediates nutrition and so primary reinforcement; after much training it 
might even refuse to eat the sham food since, not being reinforced, this activity 
should suffer experimental extinction. Careful controls would, of course, need 
to be carried out on such matters as the food preferences pyossessed by the dog 
just before the beginning of the experiment. Numerous variants of the above 
procedure will at once suggest themselv^ as alternatives in the solution of this 
extremely important problem. 

Are Primary and Secondary ReinforcCTaent at Bottom Two TTdngs or One? 

In the present chapter it has been seen that dei^ite wdlFmarked difference 
there are a number of striking amilarities betw^a pnmaiy and secondary re- 
inforcement. Perhaps the most notable of the abnilanties k the fact of r^nfor<^ 
ment itself. So far as our present knowledge goes, tibe iMbat structures mediat^ 
by the two typ^ of reinforcement agents are quahtativdly identicaL Tlik 
consideration alone constitutes a very considerable pn^omptaon in favor of tl» 
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view that both forms are at bottom, i.e., physiologically, the same. It is difficult 
to believe that the processes of organic evolution would generate two entirely 
distinct physiological mechanisms which would yield qualitatively exactly the 
same prciiuct, even though real duplications of other physiological functions are 
known to have evolved. 

While the ultimate proof of the essential identity of the two processes, when 
and if it comes, must be looked for on the physiological rather than on the be- 
havioral level, it is evident that the present development of neurophysiology 
is quite remote from such an achievement. Meanwhile the urgency of the prob- 
lem from the standpoint of systematic behavior theory is such as to make an 
attempt at a workable first approximation to such a proof on a molar level ex- 
tremely desirable. This would necessarily take the form of a revised inductive 
generalization which would be sufficiently comprehensive to include both phe- 
nomena as special cases of the same rule or law. While no detailed and fully 
substantiated hypothesis of this nature will be attempted here, a few suggestions 
are offered which may possibly contribute to the attainment of this end by others. 

In the development of this plan the first important fact to be considered is that 
iJie initial secondary reinforcing stimvkts acquires its reinforcing 'power through a 
process of reinforcement. Moreover, the process of successive transmission of 
this power from one stimulus situation to another, backward through a compound 
selective learning sequence, also appears always to occur in a reinforcement 
situation in which the secondary reinforcing stimulus acquires a reaction tend- 
ency. These considerations suggest rather strongly that the first secondary 
rmnforcing stimulus acquires its power of reinforcement by virtue of having 
conditioned to it some fractional component of the need reduction process of the 
gcml situafion (C?, Figure 20 ) whose occurrence, wherever it takes place, has a specific 
power of reinforcement in a degree proportionate to the intensity of that occurrence. 

Let us represent this|fractional component of the goal reaction by the symbol g. 
Referring back to the compoimd selective learning situation represented in 
Figure 2 ^, it is evident on the above hypothec that while Ss-^SbIS acquiring 
its conn^tion to Rb, the perseverative trace of sb is also acquiring a parallel connec- 

ticm to g. It follows that presently, when the connection Sb^ss acquires a 

superthreshold strength, g, by the principle of stimulus generalization (see p. 
183 ff.), will come forward in the series and occur in close conjunction with Sb 

and win therefore serve to strengthen ^84 — > «4 as well as to condition itself 

to 84 — thus ; 84 «4 > g. In the course of successive trials or repetitions, 

wh^ this connection (s4 ^ g) becomes of superthreshold strength, the same thing 

would omir with 8 $, then with St, and finally with ^1. In this way g, as both, a 
eomfirionahle pro<^^ and a rmnforcing agent, would be passed back through 
s^umaoe. There would therefore develop an underetandable modus operandi 
for compouiKi selecrive learning. 

it takes time for g to bwjome conditioned to a new trace to a degree such 
that it can act with much strength as a reinforcing agent, there is bound to be 
delay betw^n its full action at the godl and that at the beginning 
behavior si^imnce leading to the need reduction. This would, of coui^, 
pfodtaee tibe goal gradi^t, imw known to be charact^istic of such leanied se- 
iW, 11), 

We ffloir to xnc^t^ of e:q)eaimental extinction. Supper, in case 
o^rored at all the five choice points, that the organism 
the ac&m but that the piiniary reinforcing state of 
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affairs, (?, does not follow the performance of Eg. It must be supposed that the 
conditioned g as evoked by Ss — > s& will be weaker than the unconditioned g 
evoked in connection with the need reduction at G, Therefore it is to be expected 

that while Sb-^ss >g will somewhat retard the process of extinction of S5 -~> 

SB > (as in Bugelski's experiment, 1), it will nevertheless not suffice to prevent 

ultimate extinction. In a similar manner the weakening oi ^ Rb will 

rapidly bring about the weakening > g; this will result in a weakening 

oi 84,-^ Si 9 . E 4 , and so on backward throughout the sequence to its very begin- 

ning, gradual collapse of the entire behavior sequence occurring as trial follows 
trial during the extinction process. Presumably generalized extinction effects 
would contribute to the speed of inhibition throughout such a behavior series, 
especially where the trials are given in immediate succession (4, pp. 497-499). 

It is to be noted in this cormection, however, that the present hypothesis 
does not imply that secondary reinforcement will necessarily suffer experimental 
extinction when the support of the primary need reduction is withdrawn. If the 
primaiy reinforcement has been sufficiently profound for the connection Ss 

Sb 9 - ^ to be very strong, the conditioned g maybe intense enough to withstand 

the inhibition generated by an indefinitely large number of otherwise unreinforced 
presentations of &, S4, or Ss. Here, apparently, we have the explanation of what 
Gordon Allport has called the functional autonomy of higher-order conditioned 
reactions; the g^ if sufficiently well conditioned, may be strong enough in rein- 
forcing power to maintain itself through self-reinforcement — a. true functional 
autonomy. It is probable that something of this kind is operative in certain 
cases of neurotic symptoms, as has been pointed out by Mowrer (6) in his be- 
havioristic interpretation of Freud^s doctrine of anxiety. 
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CHAPTER VIII 


The Symbolic Construct as a Function of the 
Number of Reinforcements 

In the course of the preceding discussions of reinforcement the 
reader may have noticed two implicit assumptions: (1) the recep- 
tor-effector connections so set up correspond roughly to what are 
known to common sense as habits;^ and (2) the process of habit 
formation consists of the physiological summation of a series of 
discrete increments, each increment resulting from a distinct recep- 
tor-effector conjunction (sCr) closely associated with a reinforcing 
state of affairs (G). It shall be our task in the present chapter to 
try to tease out of a series of relatively simple and reasonably 
well-studied habit-formation situations at least a first approxima- 
tion to the central law or functional relationship of habit strength 
as dependent upon the number of these reinforcement increments. 

As a preliminary to this undertaking it is important to note 
that habit strength cannot be determined by direct observation, 
since it exists as an organization as yet largely unknown, hidden 
within the complex structure of the nervous system. This means 
that the strength of a receptor-effector connection can be deter- 
mined, i.e., can be observed and measured, only indirectly. There 
are two groups of such observable phenomena associated wdth 
habit: (1) the antecedent conditions which lead to habit formation, 
and (2) the behavior which is the after-effect or consequence of 
these antecedent conditions persisting within the body of the organ- 
ism. As our analysis progresses we shall find that habit strength 
defends upon various antecedent factors in addition to the number 
of reinforcements. We shall also note that habit strength may 
manif^ itself in several different measurable ways. One of these, 
the magnitude of the evoked reaction, will next be considered. 

^>eakmg, by pommon usage the referent of the term "habit'" 
^ a w^i-wotn mode of actum, whereas by the present usage the referent is a 
of the oT^imsm (reailting from the reinforcement) which is 
& but a SRxffcient^ condition for the evocation of the action 
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HABIT STEEl^GTH AISTB REACTIOISr MAGNITUDE 

The progressive increase in the magnitude of an evoked habitual 
reaction with successive reinforcements is conveniently illustrated 
by a study reported by Hovland (f). This investigator associated 
the Tarchanoff galvanic skin reaction, originally evoked by a mild 
electric shock on the wrist, with the simple sinusoidal vibrations of 



Fig. 21. An empirical learning curve plotted in terms of the amplitude in 
millimeters of the first galvanic skin reaction evoked by the conditioned 
stimulus after varying numbers of reinforcements. The circle represent mean 
readings for different but comparable groups of subjects. The curved line 
was plotted from values secured by substituting in an equation fitted to the 
data represented by the circles. (From data published by Hovland, 1, p. 268.) 

a beat-frequency oscillator. The galvanic reaction was picked up 
from the skin of the subject's hand by a pair of polished silver disk 
electrodes, one bound to the palm and the other to the back. The 
current thus secured was passed through a sensitive galvanometer, 
the magnitude of the subjects electrical reaction being shown by 
the amount of movement across a screen made by a beam of light 
reflected from a mirror in the instrument. Tliis movement was 
recorded graphically and was subsequently measured in millimeters. 
Four matched groups of 32 subjects each were given 8, 16, 24, and 
48 reinforcements r^pectively. At once following this series of 
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reinforcements the tone was presented alone, and the amplitude (A) 
of the evoked reaction was measured. The means of the condi- 
.tioned reactions thus evoked from the several groups of subjects 
are shown by the circles in Figure 21. The circle at zero is the^ 
average amplitude of reactions evoked from all groups of subjects 
previous to reinforcement. The curved line drawn among the data 
points represents a series of values calculated from a growth func- 
tion which has been fitted to the data represented by the circles. 

From an inspection of Figure 21 three observations may be 
made: 

1. The value of this reaction at zero reinforcements is not itself zero 
but has, on the contrary, the very appreciable value of 3.16 millimeters. 
This is quite characteristic, since it has long been known that the 
galvanic skin reaction is evocable in considerable amounts by any stimulus 
of appreciable intensity. 

2. The greater the number of reinforcements (and, presumably, the 
stronger the habit), the greater will be the amplitude of the evoked 
reaction. Accordingly, the amplitude of the reaction is said to be an 
increasing function of the number of reinforcements.^* 

3. Despite a certain amount of deviation of the circles from the fitted 
curve, possibly due to the limited number of data from which the means 
were calculated, the relationship appears to approximate rather closely a 
simple positive growth function. 


HABIT STRENGTH AND REACJTION LATENCY 

A second way in which habit strength may manifest itself in a 
measurable manner is in the length of time elapsing from the onset 
of the stimulus to the onset of the associated reaction This 

time interval is called reaction latency. 

The general relationship of habit strength (as indicated by the 
number of reinforcements) to reaction latency is illustrated by an 
iav^kgation reported by Simley (5). In this study college stu- 
dents a^^ociated nonsense characters, presented for five seconds 
each by means of an automatic exposure apparatus, with nonsense 
syllables pr^ented orally by the experimenter in the middle of 
each ^Kpomre. The subjects were instructed to speak each syllable 
JiMt m quickly as possible after the corresponding character was 
A voice connected with other automatic devices 
tte determination of each reaction latency as the 

for reaetioms, such as salivary seeretion 

ai^ me ^vanic sim leadatm, to apparently not for all (see p. 329 ff.). 
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learning progressed. Learning was continued long after the habits 
involved in any given rote series had passed the reaction threshold. 
Out of a large number of such stimulus-response combinations, one 
subject (M.W.) was found 


to have spoken the asso- 
ciated syllable before being 
prompted at the second pre- 
sentation of about 125 of 
the characters. The mean 
latencies of these reactions 
at the second and each of 
the following fifteen rein- 
forcements^ are shown by 
the circles in Figure 22. As 
in the case of Figure 21, a 
function has been fitted to 
these data; this is repre- 
sented by the curve which 
passes among the circles. 

An inspection of Figure 
22 shows: 
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1. There is no circle rep- 
resenting a latency value at 
zero reinforcement; this is be- 
cause no reaction of this com- 
plex type can occur previous 
to any learning. Strictly speak- 
ing, this means that the Ieu- 
tency of such a reaction is 
infinite when reinforcement is 
zero. 

2. Reaction latency is in- 
versely related to the number 


Fig. 22. Empirical learning curve plotted 
in terms of the reaction latency of speaking 
nonsense syllables at the presentation of the 
nonsense character with which they were 
paired. The circles represent means from 
about 125 such syllable reactions by a sin^e 
subject (M.W.) at 16 succ^sive charaetor 
presentations and after as many rmnforce- 
ments. The curved line represents the re- 
ciprocal of a fractional power of a pc^itive 
growth function fitted to the data (see 
equation 5, p. 121). (From results pub- 
liiied by Simley, 5.) 


of reinforcements; i.e., the 

greater the number of reioforcements (and, presumably, the stronger the 
habit), the shorter the time required for reaction evocation. Thus r^crion 
latency is said h) be a decretmng function of the number of leanfoTOe- 


mente.^ 


3. Quite as in F%ure 21, the data values deviate apprecM>Iy fr<m the 


^ Remforcement in sudi li^ming is evideiitly se(x»ndary. (See CSbapter 

vn.) 

^This statement holds te cer^dn eondirions of teainii^ but not to all 
(see p. 337). 
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fitted curve, tliougli in general the deviations are not excessive. The 
relationship thus appears to approximate rather closely the reciprocal of a 
positive growth fimction. 


HABIT STRENGTH AND RESISTANCE TO EXPERIMENTAL 
EXTINCTION 

A third way in which habit strength may manifest itself in a 
measurable manner is in its resistance to the effect of repeated 
evocations unaccompanied by reinforcement, which ordinarily pro- 
duces experimental extinction (see Chapter XV). An illustration 
of this functional relationship is found in a study by Williams (d), 



Fig. 23. Empirical learning curve plotted in terms of resistance to experi- 
mental ^tinction. The circles represent the mean number of unreinforced 
Imr-pre^ing reactions evoked by the conditioned stimulus in different groups 
of hungry albino rats after varying numbers of food reinforcements. The 
^irved line represents a positive growth fxmction which has been fitted to the 
data represented by the circles. (Data from Williams, 6 , and Perm; figure 
adapted from one published by Perin, 4 .) 


^ supplemmted by Perin ( 4 ). These investigators trained groups 
of Imnpy albino rats to depress a bar in order to secure food pellets 
r^^mbling that of Skinner and Bugelski, described 
(pp, 87, 88 ff.), Tlie several groups of animals were given 
numbers of reinfore^ents, after which the reactions were 
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no longer followed by the food reward. The mean number of 
unreinforced reactions (n) which were made by the respective 
groups of animals before an interval of five minutes or more 
occurred between two successive responses was taken as the measure 
of resistance to extinction.^ The relationship of these values to 
the number of reinforcements is shown by the circles in Figure 23. 
The cmwed line running through these data points represents a 
positive growth function fitted to them. 

An inspection of this figure shows: 

1. At zero reinforcements the number of unreinforced reactions re- 
quired to produce extinction is negative. This is a quantitative expression 
of the fact (noted above in connection with reaction latency, p. 105) 
that a habit must have a certain strength before any reaction at all can 
be evoked (see Chapter XVIII), and therefore before any directly measur- 
able extinction effects can possibly be observed. This n^ative value ( — i) 
was obtained indirectly by extrapolating backward to zero reinforcements 
the function fitted to the values obtained from superthreshold strengths 
of the habit. 

2. The greater the number of reinforcements (and, presumably, the 
stronger the habit), the greater will be the number of non-reinforced 
reactions required to produce a given d^ree of experimental extinction. 
Resistance to experimental extinction may therefore be said to be an 
increasing function of the number of reinforcements. 

3. In general the data points, in spite of the usual deviations from 
the fitted curve, approximate rather closely a simple positive growth 
function, as we saw to be the case with reaction amplitude. 


HABIT STRENGTH ANP PER CENT OF CORRECT REACTION 
EVOCATION 

A fourth way in which an increase in habit strength may mani- 
fest itself is by a change in the per cent of occurrences of the 
stimulating situation which evokes the reinforced reaction. This 
functional relationship is illustrated by the results of an unpub- 
lished experiment performed by Bertha lutzi Hull. An albino rat 
was presented with a pivoted rod projecting through a cross-shaped 
aperture in the side of a restraining box. The hollow end of the 
rod was filled with sticky food. In securing this food the animal 
incidentally moved the rod more or lei^ at random into all four 
arms of the cross, but into some much more frequently than others. 
At first the apparatus was set to give the rat a small pellet of 

^ An alternative and clt^ly related measure of this ^me function is the 
time required to produce experimmital extinction (4). 
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food whenever the rod was moved in any one of the four directions. 
After this had determined the relative strength of the animaPs 

reaction tendencies into each 
of the four arms of the cross, 
the end of the rod was care- 
fully cleaned and the appa- 
ratus set to give food pellets 
only when the rod was moved 
in the hitherto least preferred 
direction. Owing to the large 
number of trials required for 
this bit of learning, the ele- 
ment of chance was largely 
eliminated from the results 
and a very smooth curve of 
the process was obtained 
from a single animal. This 
is shown in Figure 24. The 
per cent of correct reactions 
for successive groups of 100 
trials is indicated by the 
circles. Since no equation 
has been fitted to these data, the circles have merely been con- 
nected with straight lines. 

An examination of Figure 24 shows: 

1. The greater the number of trials (and, presumably, the greater 
the relative habit strength of the reinforced reaction), the greater will 
be the per cent of correct reactions. 

2. The curve b^ins with a relatively brief period of positive accelera- 
tio n . 

3. The i^rbd of p<^tive acceleration is succeeded by a relatively pro- 
tracted period of native accelerarion. This had not reached the maxi- 
mum of 100 cent correct reactions when the experiment was 
teminated. 

4. The cKHubinarion of the pemtively and the n^atively accelerated 
jK^rlions of the l^tming curve gives it a definitely sigmoid shape which 

strika^y different from Ihat of lie three learning curves previously 

TBM COH(»T OF HABIT STRENGTH AS StTCH 

We have exanplified fc«ir cas^ of relatively simple habit 
fcmaaticHL In aH it has been assxnned that habit 
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Fig. 24. Empirical learning curve 
plotted in terms of the per cent of cor- 
rect (reinforced) reactions by an albino 
rat from a group of four possible direc- 
tions of movement of a rod projecting 
through a cross-shaped aperture. (Plotted 
from an unpublished study by Bertha 
lutzi Hull.) 
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strength has progressively increased with the number of reinforce- 
ments. On this assumption, the progressive increase of habit 
strength in each of the four learning situations is manifested as 
a distinct measurable function of the number of reinforcements. 
While no habits manifest themselves in all of the four manners at 
every point of the acquisition process, most do so at one stage 
or another. No habits mediate overt action below the reaction 
threshold (p. 323 ff.) . On the other hand, all well-established habits 
display on the presentation of the relevant stimulus a certain mag- 
nitude of reaction, a certain reaction latency, and a certain resist- 
ance to experimental extinction. Moreover, many habits at or near 
the reaction threshold (see Chapter XVIII), as well as all processes 
of selective learning at medium strengths, display a progressive 
increase in the frequency with vrhich the stimulus evokes the reac- 
tion being reinforced. Indeed, certain habits, e.g., the conditioned 
lid reaction, may manifest themselves in all four ways at some 
particular stage of their formation. 

From the above considerations, as w^ell as from everyday obser- 
vation, it is clear that at any time for months and years following 
specifiable reinforcement the presentation of the conditioned stim- 
ulus is likely to evoke the reaction. Now, it is assumed that the 
immediate causes of an event must be active at the time the event 
begins to occur. But at the time a habit action is exuked, the 
reinforcing event may be long past, i.e., it may no longer exist; 
and something which does not exist can scarcely be the cause of 
anything. Therefore, reinforcement can hardly be the direct or 
immediate cause of an act. We accordingly conclude that the 
immediate cause of habit-mediated action evocation must be a com- 
bination of (1) the stimulus event and (2) a relatively permanent 
condition or organization left by the reinforcement within the 
nervuus system of the animal. This last is what is meant by the 
term habit It will evidently be a decided convenience to speak 
of this persisting physical condition as distinct from either the 
reinforcing events which produced it or the overt activities w-hich, 
on occasion, it may itself mediate. 

In this connection it may be recalled from the prec^iing pages 
that the after-effects of reinforcement manif^t themselves in mul- 
tiple mod^. From a positivistic point of view, the question natu- 
rally arises as to which of these, if any, is to be considered the 
index of habit strength. The fact s^ms to be that no one of them 



no 


PRINCIPLES OF BEHAVIOR 


merits this distinction more than the others. A careful survey of 
the evidence has led to the belief that while habit strength is the 
dominant factor determining the amount of each of the four aspects 
of reaction due to reinforcement after-effects, the latter are acting 
in conjunction with some other and distinct factors in each case. 

What those factors may be, experimentalists are at present 
busily engaged in finding out, but some things are already known. 
It is a matter of common observation that we learn by trial and 
error specifically to react violently, moderately, or gently as a 
given situation requires in order that reinforcement shall occur 
(p. 304 ff.) . Thus magnitude of reaction may be learned as such. 
Also, Pavlov (3) long ago showed that reactions could be condi- 
tioned to various delays by suitable methods of training. Thus 
latency also may be learned as such. Finally, whether or not a 
reaction will be evoked by a given stimulus apparently depends not 
only upon the strength of the original reinforcement but upon the 
other stimuli which may be acting at about the same time; upon 
the strength of any competing reaction tendencies associated with 
the other stimuli; and upon a very large number of additional fac- 
tors, some of which will be elaborated in subsequent chapters (p. 
341 ff.). In the calculation of the magnitudes of these various 
manifestations as joint functions of habit strength it will clearly 
be a convenience to have a single value to represent the influence 
of the several factors which act together to determine its amount. 

It is quite possible, of course, that a theory of behavior could 
be developed without employing the concept of habit strength. In- 
deed, it is probable that all of the many constructs (such as elec- 
trons, protons, etc.) which are employed to represent unobservables 
in the physical sciences could be dispensed with if scientists cared 
to resort to the use of expressions sufficiently complex to represent 
explicitly all the observations from which the nature and amounts 
of the unot^ervable entities are inferred. When the relevant ob- 
^rvatioDS upon which the presence and magnitude of the unobserv- 
able caotity depend are properly represented by a single number 
or sign, they can all be manipulated at once quite adequately by 
tiie manipulation of that symbol. Indeed, this is a routine practice 
in ma1diemati(^, where such conditions obtain. To repeat every 
cue of them cm each occasion in which the group of factors as a 
whole m to be manipulated would be a pedantic waste of effort. 
On tile otter hand, if the system has been properly constructed the 
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sign can at any time be expanded by an explicit representation of 
the various factors for which it stands. This manoeuvre^ of course, 
converts the system substantially into positivistic form; which 
shows that fundamentally the use of constructs, where permissible 
(see terminal note), is no different than an ordinary positivistic 
procedure such as that advocated by Woodrow (7). 

The use of logical constructs thus probably in all cases comes 
down to a matter of convenience in thinking, i.e., an economy in 
the manipulation of symbols. It is accordingly on the ground of 
convenience and economy rather than of strict necessity that the 
attempt is here being made to retain the substance of the common- 
sense notion of habit strength. Students of behavior who have a 
positivistic distaste for logical constructs may adapt the present 
systematic approach to their own preferred mode of thinking merely 
by recalling explicitly the various antecedent factors which deter- 
mine the quantitative value of any given construct which offends 
them, each time it is encountered. 

THE SYMBOLIC REPRi^ENTATIOH OP HABIT STRE^STGTH 

From the foregoing it is evident that the chief advantage to 
be expected from the employment of the logical construct habit 
strength arises from economies in thought, i.e., in symbolic manip- 
ulation. In order to realize this advantage an appropriate sym- 
bolism must be devised. The reader will be aided in the under- 
standing of this appropriateness if he will recall a few relevant 
relationships presented earlier. Specifically it will be well for 
him to remember that the process of reinforcement sets up a con- 
nection in the nervous system whereby an afferent receptor dis- 
charge (s) originally involved in a reinforcement is able to initiate 
the efferent discharge (r) also involved in the reinforcement. But 
since the afferent ^discharge (s) is initiated by the action of a 
stimulus energy (S) on the receptor, and since the efferent dis- 
charge (r) in due course enters the effector system, producing a 
reaction (J?), we have the sequence, 



The habit organization is represented by the arrow with broken 
shaft between the neural processes s and r. If we replace this 
arrow as a representation of habit with the more convenient and 
somewhat more appropriate letter H, we have the full and explicit 
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notation for expressing the various relationships involved in the 
concept of habit strength: 

However, under most circumstances there is a close approxima- 
tion to^a one-to-one correspondence, parallelism, or constancy be- 
tween S and s on the one hand and between r and R on the other. 
Accordingly, for purposes of coarse molar analysis S or s may be 
used interchangeably, as is the case with r and R, Since we shall 
be dealing with gross stimulus situations and the gross results of 
molar activity in the early stages of the present analysis, we shall 
usually employ the symbol, 

hSn. 

Later, when we reach a point requiring a more precise and detailed 
analysis, it will be necessary not only to employ the notation 

'sBff 

thus explicitly representing the neural impulse, but to distinguish 
through further subscript modifications various aspects of both the 
stimulus and the response situations. For example, S and s repre- 
sent S and s when considered as in the process of being conditioned, 
whereas the dots will never be used when S and s are considered 
as performing the function of response evocation. 

sABrr sraEiTGTH cojtcexved as a miTCTioisr op the htjmber 
OF EEillS'FORCEMEiq'TS 

Having decided to employ the construct sHr, we proceed at 
once to the problem of determining the presumptive quantitative 
nature of its functional relationship to its various antecedent deter- 
miners. The first of these to be considered will be the relationship 
of sjETjg to the number of reinforcements {N)\ This type of deter- 
mination pre^nts certain difficulties. 

Where the members of a functional relationship are both directly 
measurable, as in the Hovland reaction-amplitude study cited 
above, the procedure for determining the approximate mathematical 
relationship is fairly straightforvrard. A table of corresponding 
^pirhml values of the two variables is prepared and usually 
jdiMed cm graph paper, as in the circle sequence of Figure 21. 
From BM of these empirical results various equations 

tmjPTO to yidd curv^ r^embling the one shown in the graph are 
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fitted to the data. This consists in the main in caiculating various 
constants ^ called for by the respective equations. * Thus in the 
equation fitted to the Hovland data shown on page 120, the values 
of .141, .033, and 3,1 are all fitted, or empirical, constants; the 10 
is an arbitrary value chosen for convenience because it is the base 
of common logarithms. In the end that equation is accepted, along 
with the values of the various constants associated with it, which, 
when the several values of one of the two variables are substituted 
in it, yields the closest approximation to the corresponding values 
of the other. In Figure 21 this approximation is shown by the 
nearness of the circles to the curve which was generated by the 
equation (see terminal notes). 

When, on the other hand, one of the variables of a functional 
relationship under investigation is a logical construct and so is 
neither observable nor directly measurable, the situation is quite 
otherwise, and the procedure for determining the quantitative rela- 
tionship is necessarily indirect and more difficult. The procedure 
in this case is to a considerable extent trial and error in nature, 
though in a rather different sense than where both sets of values 
are directly measurable. However, not all is trial and error, since 
in both situations certain supplementary principles are usually 
available for tentative guidance. This is notably true in the case 
of the probability-of-reaction-evocation curve of learning (see 
Chapter XVIII, p. 326 ff.). 

The investigation of the functional relationship of habit strength 
to the number of reinforcements is so new that the greater part of 
the trial and error involved in its determination has yet to be 
performed, even though the present attempt is the second such trial 
to be made. Taking our point of departure from extensive obser- 
vations in the field of habit formation typified by the experiments 
which yielded Figures 21 to 24, and profiting by the outcome of 
the first such attempt (^, pp. 164-165), it is concluded that very 
probably: 

1. Habit strength is an increasing function of the number of rein- 
forcements. 

2. This function increa^ up to i^me sort of physklogit^ l imit be- 
yond which no more incr^se k jK^ble. 

rather elalx)rate example of the determioaticm of such coa^^ants in 
the fitting of a curve to empirical learning data may be found in Chapter 
XH, p, 200; another may be found in reference Bf pp. 103-108. At bottom, 
much of this is dependent in one way or another upon the use of simultaneous 
equations. 
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3. As habit strength approaches this physiological limit with con- 
tinued reinforcements the increment ( A resulting from each addi- 

tional reinforcement decreases progressively in magnitude. 

Now, there are numerous algebraic expressions which yield re- 
sults conforming to the above specifications. One of these, how- 
ever, has a rather special promise because it is known to approxi- 
mate closely a very large number of observable empirical relation- 
ships in all sorts of biological situations involving growth and 
decay. Indeed, Figures 21, 22, and 23 are all cases in point. The 
basic principle of the simple positive growth function (Figures 21 
and 23) is that the amount of growth resulting from each unit of 
growth opportunity will increase the amount of whatever is grow- 
ing by a constant fraction of the growth potentiality as yet un- 
realized, 

^ THEORETICAIj curve of habit-stuength growth 

EXEMPLIFIED 

The characteristics of the positive growth function may be ex- 
hibited by means of an example. From the foregoing it is evident 
that the rate of habit growth is dependent upon three factors or 
parameters: 

1. The phydological limit or maximum {M) 

2. The ordinal number (N) of the reinforcement producing a given 
increment to the habit strength (A b^b) 

3. The constant factor (F) according to which a portion ( A 

of the unrealized potentiality is transferred to the actual habit strengOi 
at a given reinforcement 

There must also be devised a unit in which to express habit 
strength. This is taken arbitrarily as 1 per cent of the physiologi- 
cal maximum (M) of habit strength attainable by a standard 
organism under optimal conditions. In order to make the name of 
the unit easy to remember, it will be called the hab,'^ a shortened 
form of the word habit Thus under the conditions stated above 
there would be lOQ habit units, or habs, between zero and the 

M 

physiological limit, i.e., one hab = , 

100 

We proc^i now wiHi our example. Suppose that the growth 
wnstoat (F) in a ^ven reinforcement situation is taken as 1/10^ 


^ ProiHmi^sed hSb, sm in, cab. 



sHs AND THE NUMBER OF REINFORCEMENTS 1 1 5 

One-tenth of the total possibility of learning (100 units) is 10 habs 
(1/10 of 100 = 10). The generation of 10 units of habit strength 
from a base, zero, leaves 100 — 10, or 90 units of growth yet pos- 
sible of realization. Consequently the habit increment resulting 
from the second reinforcement must be 1/10 ol 90, or 9; i.e., the 
second = 9 habs. Subtracting 9 from 90, we have left 81 

units of possible growth. One-tenth of 81 in turn yields our next 
AsHb of 8.1 habs; and so on. This process can be repeated as 
many times as there are successive repetitions of the reinforcement. 
Column 2 of Table 1 shows the first 30 successive AkH^s com- 
puted in this way. These AsH^s are shown graphically at the left 


TABLE 1 

Analytical Table Showing the Theoretical Evolution of a Typical 
“Growth” Function in Which Each Increment to the BLabit Is 1/10 op 
the Potential Habit Strength as yet Unformed. (See text for details and 
Figures 25 and 26 for graphical representation.) 


Ordinal Number of 
Reinforcements 

Increment of Habit 
(AsHb) 

Total Accumulated 
Habit in Hah Units 
(XAsITb) 

1 

10 

10 

2 

9 

19 

3 

8.1 

27.1 

4 

7.29 

34.39 

5 

6.561 

40.951 

6 

5.9049 

46.856 

7 

5.3144 

52.1703 

8 

4.7830 

56.9533 

9 

4.3047 

61.2580 

10 

3.8742 

65.1322 

11 

3.4868 

68.6189 

12 

3.1381 

71.7570 

13 

2.8243 

74.5813 

14 

2.5419 

77.1232 

15 

2.2877 

79.4109 

16 

2.0590 

81.4698 

17 

1.8530 

83.3228 

18 

1.6677 

1 84.9905 

19 

1.5009 

86.4915 

20 

1 1.3509 

87.8423 

21 

1.2158 

^.0581 

22 

1.0942 

90.1523 

23 

.9848 

91.1371 

24 

.8863 

92.0234 

25 

.7977 

92.^10 

26 

.7179 

93.5389 

27 

.6461 

94.1850 

2S 

,5815 

94.7665 

29 

.^33 

95.2^ 

30 

.4710 

95.7609 
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edge of Figure 25, piled one upon the other in the order in which 
they were derived. It is notable that these increments become 
smaller and smaller until, with very large values of N, they become 
infinitesimal. 

In column 3 of Table 1 are presented the cumulative values of 
column 2. These latter values, in their turn, are, represented graph- 
ically in the main portion of Figure 25. The contour of this 
columnar figure is a rather precise representation of what is here 
conceived to be the basic ^^curve.of learning’^ from which all other 
theoretical curves of learning are derived in one way or another. 



Peg. 25. Diagrammatic representation of. a theoretical simple positive 
growth function. At the left are given the successive increments of habit 
accretion for successive reinforcements as shown in column 2 of Table 1. At 
the right may be ^en the amount of accumulated habit strength at the 
succ^ve reinforcements as shown in column 3 of Table 1, 


It will be noticed that it rises at first with comparative rapidity, 
the raie of rme gradually diminishing until at high values of N it 
become practically horizontal. Because of their progressively 
dimimshing rate of rise such curves are said to be negatively accel- 
erated. 

Tbe notched appearance of the contour of the main portion 
of Fi^ixe 25 is due to the fact that each reinforcement is a unit, 
i.e,, m ^^ntiaBy indivisible into fractional parts such as halves 
or Hurds of a rdnforc^mmit. The usual method of plotting learning 
funetiom by anooth-line eurv^ running through the points repre- 
the rulings taken after each reinforcement, is shown for 
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the values in column 3 of Table 1 in Figure 26. It is to be noted, 
however, that while for many purposes this method of representing 
the course of learning is to be preferred because of its convenience, 
there is danger of its giving the uninitiated a false impression of 
smooth continuity. Such a smooth, continuous process could result 



Fig. 26. Theoretical curve of learning a simple conditioned reaction isHs) 
as a function of the number of reinforcements (iV), plotted in the customary 
manner, which implicitly but falsely assumes that repetitions can be sub- 
divided indefinitely (see text). 

only if the successive repetitions of reinforcement were to be indefi- 
nitely subdivided into fractional parts. The cyclical nature of the 
reinforcement process precludes this. 

SUMMARY 

The effect of reinforcement may become manifest in overt action 
upon the presentation of the associated stimulus at any time during 
the subsequent life of the organism. This central fact shows con- 
clusively that reinforcement leaves within the organism a relatively 
permanent connection between the receptor and the effector asso- 
ciated in the original reinforcement- It is this which in the present 
^stem is meant by the term ^'habit,” a technical adaptation of 
the common-sense concept that go^ by the same name. 

Sinc^ the organization of the nervous system upon which habit- 
ual action is evidently based lies deeply hidden and quite remote 
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from any immediate means of direct observation, habit has the 
status of an imobservable, i.e., it is a logical construct. As such 
it is prevented from becoming a metaphysical entity by being firmly 
anchored, both antecedently and consequently, to phenomena which 
alike are observable and measurable. On the antecedent or causal 
side, habit is known to be dependent upon various factors asso- 
ciated with reinforcement, but notably upon the number of rein- 
forcements. On the consequent or effect side, habit manifests itself 
in action, ideally on the presentation of the stimulus aggregate 
originally associated with it at its reinforcement. It is important 
to note, however, that betw'een sHr and observable response phe- 
nomena there intervene several additional symbolic constructs (see 
Figure 84), each of wmicli is directly or indirectly anchored both 
antecedently and consequently to quantitatively observable phe- 
nomena. The strength of the habit is manifested indirectly by 
various measurable aspects of action: (1) reaction amplitude or 
magnitude (^4), (2) reaction latency (stR), (3) resistance to ex- 
perimental extinction (n), and (4) probability (p) of occurrence, 
i.e., per cent of appropriate stimulations which evoke the associated 
reaction (p. 326 ff.). 

From a study of the empirical relationships of the number of 
reinforcements to t 3 rpical examples of each of the four forms of 
habit action it is concluded that while all are dependent in the 
main upon habit strength, each is also dependent in part, and dif- 
ferentially so, upon other factors which enter the reinforcement 
situation. For this reason it will be convenient to have a repre- 
sentation of habit strength independent of any of its potential 
behavioral manifestations. 

The determination of the functional relationship of an observ- 
able to an unobservable presents a rather different and more diffi- 
cult probl^ tiian that of two observables. The two determinations 
are alike in that they are both dependent partly upon a process 
of trial and error, though each in a somewhat different sense. In 
tile case of an observable and an imobservable the relationship 
which is most plausible in the light of all related observable phe- 
mmmm is poskdated. This postulated relationship is then em- 
pky^i in al appropriate deductive situations. If the assumption 
fe fal^ d^uctions will lead, sooner or later, to inconsistenci^ 
wiih ^d m tii ite correction. On the other hand, if 

a vsry number of such d^uctions unif oimly agree with obsei 
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vation, this will indicate that the postulated relationship is valid 
to an increasing degree of probability. 

The postulated relationship of habit strength to the number of 
reinforcements is that each reinforcement resulte in the addition 
of an increment to the habit strength ( AgHs) which is a constant 
fraction (F) of the difference between the physiological maximum 
(M) of habit strength and the habit strength immediately preced- 
ing the reinforcement. This is a relatively imcomplicated mathe- 
matical relationship which we shall call a simple positive growth 
junction, 

NOTES 


The Pi^ent Trial-and-Error Status of Our Hypothesis as to the Relation 
of Habit Strength to Number of Reinforcements 

On i^ge 1 13 it was pointed out that the determination of the correct functional 
relationship of an unobservable to an observable is to a considerable extent 
dependent upon feial and error. As a matter of fact, the formulation of this 
relationahip contained in the present chapter is ^ second such taial- The first 
such formulation made up a part of a poskiiate set upon which was based a highly 
formalized theoiy of rote learning (S). The assumption in that ca^ was that 
habit strength is directly proportional to the number of reinforcements up to the 
physiological limit. That postulate ^nerated a theorem which is clearly con- 
trary to fact (Sy pp. 164-165). The present formulation corrects the defect thus 
revealed and so is presumably a closer approximation to the truth than was the 
first attempt; therefore it may be expected to survive somewhat longer. 

How to Compute Habit Strength 

The rather chumy method of generating the theoretical learning curve used 
above for illustrative purpose (Table 1 and Figure 25 and 26) was chc^n for 
expository reasons because of its j^chological simplicity. For systematic pur- 
the outcome of this arithmetical prcxjedure as shown in column 3 of 
Table 1 is usually repre^nted by an equation. It may be shown by rather sim- 
ple mathematical procedures that column 3 (ZAsHs) as a function of the num- 
ber of repetitions (N) is given by the equation: 

sHs = M - ( 1 ) 

where M = 100, N is the number of reinforcement repetitions, e is 10, and 

( 2 ) 

where F is the reduction constant, in the above example taken as 1/10, i.e., 
F = .1. Accordingly, 

i = log ^ j j = Ic^-i = logl.lllllH 

Now, by ordinaiy Ic^arithm tables, 

log 1.1111U+ “ -04574 {approximately). 
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Therefore, if we wish to determine the amount of sHn after five reinforcements 
we have, 

SjT _ lOQ 

Stin — 100 20.04574 X 5 

■ 100 100 

■ 10-2287 


: 100 - 


100 

1.6932 


= 100 - 59.0597 


= 40.94, 


which, except for the facts that logarithm tables give only approximate values 
and that decimals have been dropped, would agree exactly with the corresponding 
values in Table 1. 


How to Compute Increment of Habit Strength per Single Reinforcement 

In a s imil ar manner the value of the increment in habit strength due to one 
reinforcement (AsHr) at any particular stage of the learning is given by the 
equation, 

AsHb == Jkf — a? — {M — x) 10~* (3) 

where x is the strength of the habit immediately preceding the reinforcement 
which produces the increment, and M, etc., have the same values as above. 
For example, the value of which would result from a sixth reinforcement 
following the fifth reinforcement (the sHr of which was calculated above) is 
calculated as follows: substituting in equation 3, we have, 

ArUr = 100 ~ 40,94 ~ (100 - 40 . 94 ) 10~-04574 
= 59.06 ~ 


= 59.06 - 


59.06 

1.1111111 


= 59.09 - 53.154 


= 5.906. 


The value 5.9C^ is as close an approximation to the value in column 2 of Table 1, 
wl^re = 6, as is to be expected from the approximations attainable where 
cadinajy lo^nthm tabl^ are employed in the computations. 


Equations Fitted to the Learning Curves 

We boom now to the details of the analyst of the data, of Figures 21, 22, and 
Tbe Moviand data r^fareseoLted in Figure 21 are fitted fairly well by the equation, 
A = 14.1(1 - lO- esaiT) 4. 3,1^ (4) 

whkh the <^irve runmng throu^ the data points of Figure 21 been 
l)ftotted- Tins k a positive growth function. 
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The Simley data, reprinted in Figure 22, are fitted fairly well by the equar- 
tion, 

^ 

[100(1 - 10--i22.v)j-4/ w 

from which the curve running through the data points of Figure 22 was plotted, 
where ds represent the time (i) from the banning of S to the beginning of 12, 
i.e., the reaction latency. The equation reprints the reciprocal of a slightly 
complicated positive growth function. Since the denominator of the right-hand 
member of the equation becomes zero when N = 0, becomes infinite under 
the same conditions, i.e., no reaction whatever occurs when there is no habit 
strength. In point of fact, no reaction occurs when the effective reaction potential 
is less than the reaction threshold (see Chapter XX). 

ITie Williams data reprinted in Figure 23 are fairly well expre^ed by the 
equation, 

ft = 66(1 - 10--018 ^ - 4, (6) 

from which the curve drawn through the data points of Figure 23 was plotted, 
where n represents the number of unreinforced reactions performed before a 
given degr^ of experunental extinctaon develoj®. 


The Relation of the Equation Expr^ng Habit Strength (1) to 
Equations 4, 5, and 6 

It is implicit in the forgoing analysis that equations 4, 5, and 6, and the data 
represented in Figure 24, are composites, one component of which in each case 
takes the general form of equation 1. For example, equation 4, 

.4 = 14.1(1 - 10--G33*’^’) -f 3.1, 

when thus analyzed breaks up into the following: 

sHb = 100 - 100 X 10^ ^ ^ (7) 

and 

A = .141 XsHb + 3.1. • (8) 

Of th^, equation 7 expre^es habit strength (sUr) as a function of the numba* 
of reinforcements, and equation 8 expresses the amplitude of the reaction as a 
function of sHs, the latter relationship being linear (see Figure 87). 

The equation from the analysis of sSb as a function of N which emerges from 
equation 5 (Figure 22) is, 

- 100 - 100 X 10--3S2 (9) 

and that emerging from the analysis of equation 6 (Figure 23) is, 

sHr - 100 - 100 X 10-* oi8 (10) 

It will be observed that the general form of equations 7, 9, and 10 is the same, 
thou^ there is a wide variation in the coefficient of N, 

Tlie various equations corresponding to equation 8 represent the joint relar 
tionship of the final effector proc^ to (1) the results of previous learning retained 
in tibe nervcHis ^tem and (2) the particular stimulus conditions existent at the 
time of action evocation. Since numerous factors other than the mere sHr 
alter into the ultimate evocation process, the various functional rdationships 
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corresponding to equation 8 cannot properly be taken up until these factors have 
been examined. Accordingly, the final analysis of Figures 21, 22, 23, and 24 
will be delayed until Chapter XVIII. 

Does sHb Qiialify as a Quantitative Scientific Construct? 

Even though it be granted that the nature and quantitative aspects of habit 
action are immediately and necessarily dependent upon the state of the nervous 
system, it does not follow that this state, as represented by sHr^ qualifies as a 
satisfactoiy scientific construct. As already pointed out in the text, a typical 
scientific construct represents the joint, unitary action of a number of independent 
directly measurable variables in the determination of some subsequent event. 
If these iKtricMes do not act as a unit in a given situation they cannot properly he 
Seated as a quaniitaiwe construct in that situation. 

For example, there are excellent empirical grounds for believing that habit 
strength is dependent not only upon the number of reinforcements, but upon a 
number of other measurable antecedent conditions imder which reinforcement 
occurs. These antecedent factors play the rdle of independent variables. When 
finally worked out, the quantitative value of is to be thought of as a de- 
pendent variable whose value may be calculated by substituting in a more or 
less complex equation or formula the values of numerous independent variables 
such as the number of reinforcements, the magnitude of the reinforcing agent 
employed, the time from the onset of the conditioned stimulus to the reaction, 
the time from the reaction to the reinforcement, the number and nature of the 
irrdevant stimuli present during the reinforcement, and so on. AH of these- 
independent variables may be assumed to be related to habit strength in different 
ways, some favoring it and others hindering it in varying degrees. Now, in such 
a situation it is evident that an increase or decrease in the value of one of these 
independent variables may exactly offset a certain amount of decrease or increase 
in tile v£due of any of the others. This means that a given amount of habit 
may be produced indifferently by innumerable combinations of the antecedent 
varialfie values. It means, further, that all of these different combinations of ante- 
cedent reinforcement vancMes wiU yidd^ other f odors equal, in the action evocation 
situation ecacUy the same amplitude of reaction, latency of reaction, persistence of 
rmctwn, and probcMiiy of reaction. Fortunately these implications of the 
unitary natiire of a tamly scientific construct are capable of fairly straightforward 
enquiical tes^ 

It is evid^t that whether rHr, or indeed any other behavioral construct, 
win ^yfefy the requirements just outiined depends upon the outcome of a great 
axQOimt of precise quantitative experimentation, only a little of which has yet 
be^ performed, lliis uncertainty applies not only to the constructs employed 
in present sjpstem but to those of all other theoretical approaches, including 
potential ^sterns of the various Gestalt schools. If the ultimate verdict 
to be afifemative, the task of the behavior sciences will be far simpler 
timn ff it tiiims emt to be negative. Meanwhile we must resolutely face the reali- 
fes of whatev^ the future holds for us. It is believed that the 

and mo^ economical way to di«jover whether or not behavior is so 
CQU^^teted as to permit tiie nse of gemnne scientific constructs is boldly to postu- 
la^ tiwe implbafoi^ of thfe as^imption in all possible rituations, and 

or tiie hypothesis as the deductions a^ee or dis^ree witii 
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empirical findings. The beginning of such an attempt is being made in the 
pr^nt work. 
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CHAPTER IX 


Habit Strength as a Function of the Nature and 
Amount of the Reinforcing Agent 

We have seen that if a habit is to be set up, the act involved 
must be associated either with a need reduction or with some 
stimulus which has itself been associated with a need reduction. 
Now, the amount or quality of the reinforcing agent at every rein- 
forcement may clearly vary in such a way that the degree of need 
reduction will range from very large amounts through very small 
amounts down to a value of zero, at which point presumably no 
reinforcement at all will occur. In short, the amount (or quality) 
of the agent employed at each reinforcement appears as the second 
of the numerous antecedent conditions determining habit strength. 

It is evident even from the preceding analysis that somewhere 
between the extremes of zero need reduction and a maximum value 
of such reduction a transition must be made from zero amount of 
the reinforcing agent to an amount of considerable magnitude. The 
question arises: Is this transition abrupt, or is it gradual and pro- 
gressive? And if it is progressive, what is the law of its progress? 
A small amount of experimental evidence bearing on this question 
is available, some of it concerned with conditioned-reflex learning 
and some with selective learning. 

THE lUMIT OF COOT)ITIONED-EEFLEX LEAENING AS A FUNCTION 
OF THE AMOUNT OF THE REINPOKaNG AGENT 

Gantfc (2) has reported a conditioned-reflex experiment of the 
Pavlovian t3rpe in which each of several animals was conditioned 
to four different stimuli, one stimulus being reinforced by one-half 
gram of food, one by one gram, one by two grams, and one by 
twelve grams. The four conditioned reactions were reinforced in 
random order not only on different days but during the experimen- 
tal se^on on the same day. After considerable amounts of train- 
ing it was found that some dogs developed clearly differentiated 
reactions to the several stimuli, though others were unable to do 
this. Ihe m^n results from one of the former, named “Billy,” 
a very stable animal, are shown by the circles in Figure 27. The 
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curve running through these values is a simple positive growth 
function originally fitted to them by Dr. Gantt (^). The close- 
ness of the approximation of the fitted to the empirical values indi- 
cates considerable consistency despite the small number of points 
involved. 

Notwithstanding the resemblance of this curve to those char- 
acteristic of ordinary learning, it is not to be confused with a 



Fig. 27. Graphic representation of the empirical functional relationship 
between the amount of the reinforcing agent (food) employed at each rein- 
forcement of four conditioned reactions to as many different stimuli, and the 
final mean amount of salivan’ secretion evoked by each stimulus at the 
limit of training. The appreciable secretionai value of 75 units when the 
fitted cun’e is extrapolated to where the amoimt of reinforcing agent equals 
zero is presumably due to secretion evoked by static stimuli arising from the 
experimental enviix>nment. Plotted from unpublished data from the dog 
kindly furnished by Gantt (f) and here publidied with his permis- 
sion. The experimental work upon which this graph is based was performed 
previous to 19^ (personal communication from Dr. Gantt). 

learning curve; on the contrary, each of the data points of this 
investigation represents the mean response on the part of the dog 
at the limit of training to the respective amounts of reinforce- 
ment; i.e., each represents the final horizontal portion or asymptote 
of a separate and distinct curve of learning, 

THB RATE OF SELECTIVE liEAENING AS A FUNCTION OF THE 
AMOUNT OF THE REINiGRCING AGENT 

Grindley (^} has reported a study which is closely analogous 
to that of Gantt but w’Mch involves selective rather than condi- 



126 


PRINCIPLES OF BEHAVIOR 


tioned-reflex learning. This investigator trained five groups of 
twelve-day-old chicks to traverse a runway eight inches wide and 
four feet long. The reinforcing agent or reward was grains of 
boiled rice placed in a shallow tray at the end of the runway. One 
group of chicks found and ate one grain of rice on reaching tJie 
tray after successfully traversing the runway; one group received 
two grains; another, four grains; and a fourth, six grains. A fifth 
group received no food whatever on reaching the tray. The score 


adopted as an index of learning was i.e., 100 times the recipro- 


cal of the time in seconds required by a chick to traverse the 
runway and begin eating the rice. 

Grindley published composite learning curves for his several 
groups of chicks. Pooled measurements of the comparable later 



Fig. Empirical graph representing" the rate of selective learning as a 
function of the magnitude of the reinforcing reward. Each circle represents 
the mean score at the la^ five of seven trials of ten chicks in traversing a 
four-foot runway to ^cure differing numbers of boiled-rice grains. (Derived 
frtm measurements made of learning curves published by Grindley, $, p, 174.) 


fKniatms of th^ B&veral curv^ are shown in Figure 28. In order 
to permit fnrtib^ comparison with the Gantt study, a simple posi- 
tive fune^on has be^ fitted to the Grindley data; this is 
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represented by the cur\^e passing among the circles of Figure 28. 
While the deviations of the data points from the curve are con- 
siderable, the fit is still close enough to indicate a fair approxima- 
tion. 

Grindley’s results are corroborated and extended by a study 
reported by Wolfe and Kaplon (4). These investigators repeated 
the substance of Grindley's experiment, utilizing with groups of 
chicks an explicit trial-and-error task, that of learning a simple 
T-maze. A comparison was made between the reinforcing power 
of one-fourth of a kernel of popcorn, a whole kernel, and four 
quarters of a kernel given at once. It was found that a whole 
kernel was more reinforcing than a quarter of a kernel, w^hich 
confirms the findings of both Gantt and Grindley. However, four 
quarter kernels given at once proved to be distinctly more rein- 
forcing than did a single intact kernel. In the case of the four 
quarter kernels, of course, the chickens pecked and swallow^ four 
times after each succ^sful act. We thus appear to Lave in this 
ca^ the paradox of four more or less distinct reinforcements con- 
tributing by summation to produce the increment of learning re- 
sulting from a single successful action sequence. 

In spite of the great difference in the organisms involved, in 
the index of learning employed, and in the stage of learning repre- 
sented, the results of the selective learning experiments show a strik- 
ing general agreement with the conditioned-reflex results of Gantt. 
It accordingly seems fairly clear from the two types of studies that 
the rate of learning is an increasing monotonic function of the 
amount of the agent employed at each reinforcement. 

HOW I>OES THE AMOUNT OF THE REINFORCING AGENT INFLUENCE 
THE TWO PARAMETERS OF THE CURVE OF HABIT FORMATION? 

We saw above (Chapter \TII) that the curve of habit strength 
as a function of the number of reinforcements was dependent upon 
tw’o constants, (1) the physiological maximum of habit strength 
(M), and (2) the fractional part (F) of the as yet unrealized 
potentiality of habit-strength acquisition, vrhich is added to the 
actual habit strength at each reinforcement. Assuming the sound- 
ness of this hypothesis, it is evident that the influence of increas- 
ing the amount of the reinforcing agent on the size of the increment 
of habit strength (AsHr) at a given reinforcement must r^ult 
from an increase in one, and possibly both, of these parameters. 
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The parameter, or parameters, involved could be determined if 
we had reliable learning curves which were carried up to, or near, 
the limit of practice under different amounts per reinforcement of 
the same reinforcing agent. Unfortunately no such investigation 
has yet been reported in detail, though there are indications in 
both the study of Gantt and in that of Wolfe and Kaplon that, 
at the limit of practice, habit strength is definitely greater where 
the amount of the agent employed in reinforcement is greater. 
Gantt has published no practice curves, and those published by 
Wolfe and Kaplon are not sufficiently regular to make profitable 
a precise analysis from this point of view. An inspection of these 
latter curves and the tables accompanying them suggests, however, 
that in habit formation the F- value may be approximately constant 
for different amounts of the reinforcing agent or reward. The only 
possible remaining parameter which could produce the slower learn- 
ing with small amounts of the reinforcing agent is the asymptote 
or upper limit of the learning curve. 

In this connection it will be recalled from the last chapter that 
the physiological limit of habit strength under absolutely optimal 
conditions was taken as 100 habs, p* 114. To this value was 
assigned the S3Tiibol M. The introduction of the presumption that 
the asymptotes of learning curves may vary below this level, de- 
pending on the amount and quality of the reinforcing agent, makes 
it necessary to employ a separate symbol [M') to represent such 
limits or asymptotes. We now state the working hypothesis at 
which we have arrived: In a learning situation which is optimal 
in all other respects, the limit (Jkf') o/ habit strength {sHr) attain- 
able with unlimited number of reinforcements is a positive growth 
function of the magnitude of the agent employed in the reinforce- 
ment process. This tentative conclusion is based on admittedly 
inadequate grounds and will therefore be subject to reexamination 
and revision when more satisfactory evidence becomes available. 

An important extension of this hypothesis at once suggests 
it^f. Clearly a reinforcing agent may vary in quality as’ well as 
in quantity. More specifically, with quantity remaining constant, 
a reinforcement by one agent may reduce the need more than a 
reinfomement by another. For example, a standard food may be 
adulterate! by adding varying amoxmts of some inert and tasteless 
such as a flour consisting of ground wood. It is evident 
that two grams of a standard dog ration mixed with 50 per cent 
wo(Ki flour would reduce an animaFs food need only half as much 
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as would the same quality of the food unadulterated. While bo 
report of such an experiment has been found, it seems reasonable 
to suppose on the analogy of the experiments of Gantt and Grind- 
ley that a food so adulterated would be less effective as a ^^pri- 
marj’-'' reinforcing agent than would an equal weight of the natural 
food. However, Judging from the shape of the curve of Figures 27 
and 28 it is to be expected that the limit of learning (m) resulting 
from reinforcement with the adulterated agent would be more than 
half as effective as would be that resulting from the use of the 
unadulterated agent. 

We conclude, then, that habit strength at the limit of 'practice 
(m) will vary with the quality, as well as the quantity, of the rein- 
forcing agent from a minimum of zero to a physiological maximum 
of 100 hubs, and that the rate of approach to that limit (F) will 
remain unchanged, 

SOME IMPUCATIOKS OP THE AM0U2TK)F-REINF0RCEME3SrT 
HYPOTHESIS 

The meaning of the working hypothesis just formulated may be 
clarified by indicating one or two of its implications. Let it be 
supposed, on the analog}’ of the Gantt experiment, that the maxi- 
mum amount of a given food when used as a reinforcing agent would 
yield at the limit of practice a habit strength of 80 habs. Calcula- 
tions based on the F-constant fitted to the Gantt data show that if, 
under these conditions, one gram of this food were used at each 
reinforcement, the maximum habit strength to be expected at the 
limit of practice would be 23.75 habs. A parallel computation 
shows that the maximum to be expected from the use of six grams 
of this food at each reinforcement w’ould be 70.14 habs. 

With these maxima available it is possible to calculate the theo- 
retical course of habit-strength acquisition under the respective 
conditions by substituting first one of the values for m in the simple 
positive grov’th function, and then the other, letting the fractional 
incremental factor, F, equal 1/10, as in the illustration of Chapter 
VIII. In this way were computed the values from which were 
plotted the two main curves of Figure 29. These curves, taken 
together, show the effect upon the course of habit formation implied, 
by the hypothesis put forward above. 

An additional implication of the working hypothesis may still 
further clarify its meaning. Let it be supposed that the habit 
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has been reinforced with one gram of the food fifteen times and 
that the reinforcement is then suddenly shifted to six grams on the 
next fifteen reinforcements. Neglecting the presumptive persever- 
ating influence of secondary reinforcement in the situation, the out- 
come is easily calculated by methods analogous to the determina- 
tion of the two main curves of Figure 29. This is shown by the 
dotted curve rising from the one-gram curve at the fifteenth rein- 



Feg. 29. Graphic representation of the theoretical course of habit-strength 
acquisition with a six-gram food reinforcement (broken line) and with a one- 
gram food reinforcement (solid line). The dotted curve indicates the theo- 
retical course of habit-strength acquisition on the assumption that rein- 
forcement is abruptly shifted to six grams on the sixteenth trial of the 
one-gram reinforcement curve, 

forcement. A glance at Figure 29 shows that according to the 
present ht^thesis an increase in the amount of the reinforcing 
agent should be followed by a marked increase in the rate of habit- 
strength acquisition; this, however, would gradually lessen as the 
limit is approached, in accordance with the nature of the 
^mple growth function. In concluding our discussion of Figure ^ 
it Is to be ftat exactly analogous curves would also be pro- 
duced by suitable variations in the quality of the reinforcing agent. 
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Finally, an additional case may be mentioned — ^that in which 
the six-gram habit would be reinforced ten times. This would 
generate a habit strength of 45.68 habs, which is considerably above 
23.75 habs, the maximum attainable by means of a one-gram rein- 
forcement. Then the reward would suddenly be shifted to one 
gram. Assuming these conditions, and ignoring secondary reinforc- 
ing effects, it is to be expected that successive reinforcements would 
result in a progressive weakening of the habit. Consideration of 
this interesting but complex problem must be deferred imtil the 
phenomena of experimental extinction have been taken up in detail 
(p. 258ff.). 

THE AMOUNT OF THE EEINFOBaNG AGENT AND THE PROBLEM 

OF INCENTIVE 

Although the systematic pr^entation of the subject of moti- 
vation has b^n r^awed for a later chapter (p. 226 ff.), it becomes- 
desirable here to touch briefly on one of its phases. Motivation has 
two aspects, (1) that of drive (B, or Sb) characteristic of primary 
needs, and (2) that of incentive. The amount-of-reinforcement 
hypothesis is closely related to the second of these aspects. The 
concept of incentive in behavior theorv" corresponds roughly to the 
common-sense notion of reward. !More technically, the incentive 
is that substance or commodity in the environment tvliich satisfies 
a need, i.e., which reduces a drive. 

Let us suppose that in a simple selective learning situation in- 
volving a hunger drive, the food employed as a reinforcing agent 
is plainly visible at the moment the organism performs the several 
acts originally evoked by the stimulus situation; that the indi- 
vidual reinforcements are separated by a number of hours; and 
that the amount of food employed in the several reinforcements 
varies at random from almost zero to very large amounts. Under 
such circumstances it follows from the principle of reinforcement 
that the visual stimulus arising from the food will be conditioned 
to the successful act by the subsequent reinforcing state of affairs, 
the consumption and absorption of the food. Moreover, in case the 
amount of food shrinks to zero there will be no direct reinforce- 
ment at all. From these considerations, coupled with the amount- 
of-reinforcement h3rpothesis, it may be inferred that the successful 
reaction will be more strongly conditioned to the stimulus aggregate 
arising from a large piece of food than to that from a small one. 
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Therefore, given a normal hunger drive, the organism will execute 
the correct one of several acts originally evoked by the situation 
more promptly, more vigorously, more certainly, and more persist- 
ently when a large amount of food is stimulating its receptors than 
when they are stimulated by a small amount. 

Substantial confirmation of this deduction is furnished by a 
recent experiment reported by Fletcher (f), who trained a chim- 
panzee to secure pieces of banana by pulling into its cage a weighted 
car by means of an attached rope. It was found that the animal 
would perform more work for a large piece of banana than for a 
small one. The maximum amount of work which would be per- 
formed for a given amount of banana incentive between the range 
of .64 and 3.77 units was practically linear. 

An extrapolation of Fletcher’s linear relationship just men- 
tioned suggests that the animals would have performed with a zero 
amount of incentive more than half as much work as was performed 
'with the incentive at 3.77 units. This might well occur on a few 
occasions due to the conditioning of the reaction to the stimuli 
arising from the apparatus and other environmental elements. On 
the other hand, in case no food is present the strength of the re- 
maining stimulus elements conditioned to the reaction may be so 
far depleted by the absence of the incentive component of the 
stimulus compound (see Chapter XV) that the effective habit 
strength will either be less than the reaction threshold or than 
the strength of some competing reaction tendency evoked by the 
stimulus situation; in either case the reinforced reaction may not 
occur at all. In the event that it does occur under such conditions, 
however, experimental extinction will presently set in and soon 
terminate it. 


SUMMARY 

Since the amount of need reduction presumably varies with the 
amount of the reinforcing agent consumed by the organism, it fol- 
lows as a strong probability from the dependence of reinforcement 
upon the amount of need reduction that the increment of habit 
sia^ength (AsHr) per reinforcement will be an increasing function 
of the amount of the reinforcing agent employed. This a 'priori 
expectation is substantiated by empirical investigations involving 
lx>th elective and conditioned-reflex learning. Moreover, these 
studio indicate iJiat the relationship is that of a simple positive 
growth fimction. 
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Because the law of simple habit acquisition is presumably a 
positive growth function of the number of reinforcements, it fol- 
lows that the increased rate of habit acquisition with increased 
amounts of the reinforcing agent employed may be due to one or 
both of two factors: (1) an increased limit of potential habit- 
strength growth, or (2) an iucreased fraction of this potentiality 
which is added to the habit at each reinforcement. The empirical 
evidence on this point is at present inadequate for final decision. 
Pending the appearance of more complete evidence, the working 
hypothesis is adopted that an increase in either the quality or the 
quantity of a reinforcing agent increases the rate of learning by 
raising the limit (m) to which the curve of habit strength ap- 
proaches as an asymptote, the rate of approach (F) to this limit 
pcssibly remaining constant for all qualities and amounts of the 
reinforcing agent employed. 

From the amount-of-reinforcement hypothesis may be derived 
a special ease of one phase of motivation, that of incentive or sec- 
ondary motivation. Ihis is the situation where the incentive (rein- 
forcing agent) contributes a prominent, direct component of the 
stimulus complex which is conditioned to the act being reinforced. 
The stimulus component arising from a large amount of this 
substance will be different from that arising from a small 
amount, and will differ still more from a stimulus situation con- 
taining a zero amount. It follows from this and the amount-of- 
reinforcement hypothesis that in the course of reinforcement by 
differing amounts of the reinforcing agent, the organism will in- 
e't’itably build up stronger reaction tendenci® to the stimulus aris- 
ing from large amounts than to that from small amounts, and 
no habit strength at all will be generated by zero amounts. It 
thus comes about, primary motivation (e.g., hunger) remaining 
constant, that large amounts of the agent will evoke more rapid, 
more vigorous, more persistent, and more certain reactions than 
will small or zero amounts. Thus a reinforcing agent as a stimulus 
becomes an incentive to action, and large amounts of the agent 
become more of an incentive than small amounts. This a priori 
expectation is well substantiated by quantitative experiment as well 
as by general observation. 
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NOTES 

The Equations Which Were Fitted to Gantt's and Grindley's Empirical 

Data 

The equation fitted by Gantt {2) to the data represented by the circles shown 
in Figure 27 is, 

A ^ 235 a- 10--153 v>y + 75 ^ 

where A is the amount of salivary secretion in arbitrary units during a constant 
number of seconds, and w is the weight of the reinforcing agent in grams employed 
at each trial. The + 75 is arrived at by extrapolation. 

The equation fitted to the Grindley data represented by the circles of Figure 28 
is, 

^ = 21.38 (1 - 10--362 -0 + .5, 

where t is the time in seconds required to traverse the four-foot runway to the 
food and n (number of rice grains) is the magnitude of the reinforcing agent 
employed. The + .5 represenls the score resulting from spontaneous exploratory 
activity previous to receiving reinforcement at the end of the runway. 


The Equations From Which the Curves of Figure 29 Were Derived 


The equation from which the upper curve of Figure 29 was derived is, 
sHr = 70.14(1 - 

in which the 70.14 was derived from the equation, 

M' = 80(1 - 10-*i53 (12) 

where w = 6 grams. 

The equation from which the lower curve of Figure 29 was derived is, 
sHr = 23.75(1 - 10-04576 


in which the 23.75 was derived from the equation, 
M' =80(1 - 10“-i53«), 

where to = 1 gram. 


Th^ equation from which the dotted curve rising from the 1-gram curve was 
(teiv^is. 


= (70.14 - 18.86)(1 - 10~-0457«^ + 18.86. 
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CHAPTER X 


Habit Strength and the Time Interval Separating 
Reaction jBrom Reinforcement 

We have seen in the last two chapters that habit strength is 
dependent upon two measurable aspects of reinforcement: (1) the 
number of reinforcements and (2) the intensity (amount and 
quality) of the reinforcing agent. In the present chapter we shall 
consider the functional relationship of habit strength to a third 
measurable aspect of reinforcement — ^that of the time interval sepa- 
rating the reaction bemg conditioned from the reinforcing state of 
affairs (p. 80). 

OBIGIN AND FBACnONATION OF THE PBOBUSM 

The notion that the habit strength resulting from the conjime- 
tion of a receptor and an effector process is a function. of the tem- 
poral nearness of a reinforcing state of affairs has long been cur- 
rent. It seems first to have been formulated by E. L. Thorndike 
in 1913, on the basis of general observation as an aspect of his 
“law of effect” {11, p. 173). Substantially the same idea was put 
forward, apparently independently, by Margaret Washburn in 1926 
m, p- 335) . Thorndike expresses no opinion concerning the logical 
status of the principle. Washburn, however, clearly regarded it as 
a secondary, rather than a primary, principle. She believed it 
could be derived from the associative “law of recency,” a supposed 
basic principle of learning once much in vogue but, in the light of 
recent work, now regarded primarily as a function of perseverative 
stimulus traces (p. 71). 

Thorndike’s original hypothesis breaks up into a number of 
distinguishable aspects which present convenient points of de- 
parture for a systematic consideration of the subject: 

1. Is there, in fact, a functional dependence of habit strength upon 
the time interval separating the receptor-effector conjunction gCj. from the 
reinforcing state of affairs? 

2. Assuming such a functional dependence, what is the direction of 
slope of the gradient? 
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3. How far does this gradient extend from the point of reinforce- 
ment before falling approximately to zero? 

4. What is the shape of this gradient; i.e., what are its mathematical 
characteristics? 

5. What parameters of the curve of learning are affected by the 
eradient of reinforcement— the rate of rise (F), or the limit of rise (m), or 
both? 

6. Is the relationship a primary principle or is it a secondary one; 
i.e., can it be derived from other and more basic principles? 

7. WTiat other behavior processes, if any, are derivable as secondary 
phenomena from this principle? 

EARLY DIEEC?r EXPERIMENTAL ATTACKS ON THE PROBLEM 

A great deal of experimental effort has been devoted to the 
solution of one or another of the subsidiary problems growing out 
of Thorndike’s original hypothesis. The first of these studies, pub- 
lished in 1917, was by John B. Watson {15), The apparatus of 
this experiment consisted essentially of a food chamber surrounded 
by sawdust to a depth of four inches. The task of the subjects, 
twelve himgry albino rats, was to dig through this sawdust, find 
a round hole giving access to the food chamber, and secure food 
which was in a shallow cup covered by a lid. Perforations in the 
lid allowed free passage of food odors. As preliminary training, 
the animals Eved for a time in the food chamber, where they appar- 
ently ate freely from the food cup. 

When the digging tests began the rats were divided into two 
groups, the animals of one group being allowed to eat as soon as 
they reached the food cup; but when the animals of the second 
group reached the food cup the perforated cover was held in place 
for 30 seconds before eating was permitted. In the course of 27 
trials both groups of a nim als gradually reduced the time of reach- 
ing the fo<Ki cup from around 100 seconds to six or seven seconds, 
thou^ there were no indications of any special advantage in rate 
of learning of either group over the other. 

The next experiment to throw much light on the problem was 
reported in 1929 by Mrs. Hamilton (nee Haas) (J). She employed 
albino rate in a Warden compound Y-maze involving five succes- 
sive choice, each of one correct or one incorrect turn. Between 
pke choice point of the maze and the food box a retention 
chamter placed where the animals could be held as long as 
before bdng permitted to entex the food box and eat. Five 
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groups of approximately 20 animals each were used, the delays in 
the retention chamber employed with the respective groups being 
Oj 1, 3; 5, and 7 minutes. 

A clearly marked difference in learning rate was found between 
the group permitted to eat at once and the various delay groups, 
but there was little indication of a consistent advantage in the 
shorter delay groups over those subjected to longer delays. All 
of the animals learned, the several delay groups requiring roughly 
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MINUTES OF DOAY IN RBNFORCEMENT 

Fig. 30. Empirical delay-of-reinforcement gradient. The circles represent 
amount of learning for constant numbers of reinforcements as a function of 
the delay in the occurrence of the reinforcement. These values have been 
calculated from data published by Wolfe U6). 


twice as many trials to reach a given criterion of learning as did 
the no-delay group. The Hamilton study, in contrast to that of 
Watson and an intervening experiment by Warden and Haas (13) ^ 
accordingly indicates that (1) there is in fact a gradient of rein- 
forcement, (2j the gradient slopes do^^mward from immediate rein- 
forcement as the length of delay increases (both quite as Thorn- 
dike and Washburn supposed), and (3) there is a sugg^ion that 
the gradient change its nature in some important sense when de- 
lays of reinforcement exceed one minute. There is also an indiea- 
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tion that the employment of a separate chamber for the restraint 
of the animal before it enters the food box is somehow critical in 
bringing out the influence of the period of delay on learning 
rate. 

The next experimental investigation of the problem which de- 
mands our consideration was performed by Wolfe (16). The more 
significant portion of the Wolfe study concerns the learning of a 
single unit of a simple T-maze by eight groups, each of eight 
albino rats, with the following food delays for the respective 
groups: 

0" 5" SO" r 2.5' 5' 10' 20'. 

Wolfe employed special delay chambers, one for the final correct 
choice and one for the final incorrect choice, both being distinct 

from the food chambers. In- 
dices of habit strength calcu- 
lated from Wolfe^s published 
tables yield the values shown 
graphically in Figure 30. 
There it may be seen that the 
gradient falls sharply from 
delays of zero to those of one 
minute, after which the slope, 
though upon the whole con- 
tinuous, is much more grad- 
ual. This study is in general 
agreement with Hamilton's 
findings in showing a critical 
change at a delay of around 
one minute; it also fills in the 
important gap lying between 
zero and 60 seconds by sup- 
plying evidence of the rate 
of leaifeing at delays in re- 
inforcement of five and 30 
seconds r^pectively. The values in this region when carefully 
exmnin^ turn out to approximate very closely a negative growlh 
function, as shown in Figure 31. This gives us the first convincing 
cine concerning Ihe answer to our third and fourth questions f ormu- 
1^^ abova 


50 40 ^ 20 10 

OF oeuY m rontorcemoit (t ) 

Feg. 31. The first four of the Wolfe 
deiay-of-reinforcemeiit values, together 
with the negative growth or decay func- 
tion which fits them rather well. This 
curve represents a decrease of between 
1/15 and 1/16 at each additional second 
of delay in reinforcement. 
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perin’s expeehment 

The most recent experiment m this field is reported by Perin 
{10). In that portion of his investigation which especially con- 
cerns us here, Perin employed a modified form of the Skinner box 
(see p. 87 above). Through a horizontal slot in a metal plate 
on one wall of the experimental chamber there projected an easily 



Fig. 32. Graphic representation of the gradient of reinforcement as in-, 
dicated by the slopes (tangents) of the composite learning curves of five of 
Perm’s group® of animals at the point where 50 per cent of the trials were 
without “error.” The slope of each group is represented by a circle; the 
curved line running through the circles is a special form of negative growth 
function which was fitted to these values. (Reproduced from Perin, 10.) 


moved brass rod. During the habituation period the apparatus 
was so set that a movement of this rod a few millimeters to either 
the right or the left would immediately deliver a pellet of food to 
ihe food cup beneath. After the animal had learned to perform 
both acts with some facility and his preference for one of them 
had been determined, the setting of the apparatus was so changed 
that (1) a movement of the rod in the preferred direction gave no 
food, and (2) a movement of the rod in the non-preferred direction 
caus^ (a) the rod instantly and silently to be withdrawn and 
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(b) the delivery of a pellet of food after time intervals varying 
according to the group involved, as follows: 

0" 2" 5" 10" 20" 30". 

As a rule the animals sat quietly by the food cup after the with- 
drawal of the bar, and simply waited for the food to be delivered. 
It is not without significance that many of the animals of the 
30-second group ceased to operate the bar after varying numbers 
of trials; because of this the results of these animals could not be 
employed in the plotting of Perm’s gradient of reinforcement. 

The results of this experiment which are of special interest in 
the present context are shown graphically in Figure 32. There it 
may be seen that the rates of learning, as indicated by the slopes 
of the learning curves of the several groups of animals at the 50 per 
cent level of correctness, show a negatively accelerated descending 
gradient much as does Wolfe’s study (Figure 31). There is, how- 
ever, this difference: the slope of Perm’s curve is of such a nature 
that, when extrapolated, it falls to zero at 34 seconds ; with a some- 
what different method of plotting the learning curve from which 
the tangents were taken, the extrapolated gradient falls to zero 
at 44 seconds. 

THE EECX>2SrCILIATIOK OF SOME EXPERIMENTAL PARADOXES 

The most outstanding paradox encountered among the experi- 
mental results outlined above is the fact that the Watson study 
and a comparable one by Warden and Haas {13) show no gradient 
of reinforcement, whereas the Hamilton investigation and, particu- 
larly, that of Wolfe clearly indicate such a gradient. As already sug- 
gests, the difference in the outcomes of the two groups of studies 
is probably to be attributed mainly to the fact that in the first 
two the Intention or delay occurred in the food chamber, whereas 
in the latte* two it took place in a separate (non-feeding) com- 
partment. 

Actually, on the basis of secondary reinforcement, we should 
expect exactly such a difference in the outcome of the two groups 
of experiments. It may be recalled (Chapter VII, p. 97) that 
stimulus which is closely and consistently associated with a 
reinforcing s^te of affairs will itself gradually acquire the power 
of ^cmdarj reinfcn^emmt regardless of whether the transmitting 
^mulus m primarily or secondarily reinforcing. Thus in the Wat- 
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son experiment and also in the Warden and Haas study the food 
chamber would largely have acquired this power from the prelimi- 
nary or habituation training for both groups of animals alike. By 
the same principle, the odor of the food which came through the 
perforations in the food-cup lid in both experiments had already 
acquired secondary reinforcing power as the result of the animals' 
having eaten the food, even before habituation to the food chamber. 
There was therefore, on two counts, no delay in the effective (sec- 
ondary) reinforcement in either group and so, naturally, it is not 
to be expected that a delay of 30 seconds in the actual eating 
would produce a retardation in the learning rate large enough to 
be detected by the use of small groups of animals. 

Turning, now, to the Hamilton and Wolfe studies we still find, 
even with the separate retention chamber, the conditions which are 
both necessary and sufiBcient for secondary reinforcement, though 
in an attenuated form. In these investigations the eating of the 
food is immediately associated with the food cup and the food 
chamber by both groups alike, so that all parts of the food cham- 
ber as stimuli must gradually acquire secondary reinforcing powers 
as the trials increase in number. Next, since the stimuli arising 
from the door giving access to the food chamber are associated 
closely with the food chamber itself, this door must gradually 
acquire secondary reinforcing powers following the acquisition of 
these powers by the food chamber as a whole. In a similar manner, 
the entire retention chamber would gradually acquire a measure 
of secondary reinforcing power from reinforced association with 
this door, though presumably the longer the delay there, together 
with the incidental irrelevant activity during the periods of delay, 
the slower would be this process. This probably explains the fact 
that Hamilton's animals continued to learn with fair speed under 
delays of reinforcement up to seven minutes, and Wolfe's animals, 
imder delays up to twenty minutes. 

Moreover, considering the simplicity of Wolfe’s maze, which 
involved only a single choice, his animals learned notably more 
slowly than did Hamilton's, which were required to learn five 
choices. This paradox quite probably was due in part to the fact 
that Wolfe employed a retention chamber on the false choices as 
well as on the correct ones. Presumably the two retention cham- 
bers were physically identical. Under these conditions the secon- 
dary reinforcing power acquired by the retention chamber next to 
tile food box would generalize (p. 183) to the one outside the non- 
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reward box, and so tend at first to reinforce the false choice and 
thereby retard the learning. In addition, the extinction effects aris- 
ing from the non-reinforcement of the incorrect final choices w-ould 
generalize (p. 264) to the correct choice to some extent, thereby 
weakening that reaction and still further retarding the learning. 
Also presumably contributing to the relative slowness of the learn- 
ing of Wolfe^s animals is the fact that inside the retention chamber 
on the non-reward arm of the maze w^as a dish containing food 
from w-hich the animal was excluded by a wire-screen cover. In 
all probability the odors and other associated stimuli arising from 
this non-eating food situation constituted powerful secondary rein- 
forcing influences, as in the Grindley experiment (p. 126), which 
also would tend to reinforce the false choice until extinction should 
supervene. 

Finally, there is reason to believe that in all of the above 
studies, but especially in the Wolfe study, there enters the compli- 
cating factor of spatial orientation. This is the capacity possessed 
by most organisms to return to a point of reinforcement if the dis- 
tance is not too great. The mechanisms mediating this behavior 
are complex and cannot be taken up in this place. 

The conditions of Perin^s experiment, on the other hand, were 
designed in such a way as to preclude all irrelevant secondary rein- 
forcement as well as all irrelevant spatial orientation. This pre- 
sumably accounts for the fact that (1) the extrapolation of Perin's 
gradient falls to zero, whereas the extrapolation of Wolfe^s gradient 
lacked much of doing this; and (2) Perm’s gradient reaches zero 
at a delay of between 30 and 40 seconds, whereas Wolfe’s gradient 
continues to fall for 20 minutes, and the asymptote of this fall lacks 
much of being zero. 

FOEMUIAIION OP THE GOAL GRADIENT HYPOTHESIS 

As we have just seen, Perm’s investigation suggests that, uncom- 
plicated by irrelevant factors, the basic temporal gradient of habit 
stroigth as a function -of the delay in reinforcement in the case 
of ihe albino rat actually extends over a relatively short period of 
time, possibly no more than 30 seconds and very probably less than 

^^nds. On ihe other hand, the Wolfe study indicates that 
und^ ordinary learning conditions, where plenty of opportunity 
for i^eottdary rdnforcement usually exists, the gradient may extend 
in iMM^derable force for a relatively long period. This means that 
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what was originally regarded as a single principle has turned out 
upon intensive investigation to involve two fairly distinct prin- 
ciples: (1) the short gradient reported by Perin, which will be 
called the gradient of reinforcementj an expression coined by Miller 
and Miles (9) ; and (2) the more extended gradient which is pre- 
sumably generated as a secondary phenomenon from Perin's gradi- 
ent of reinforcement acting in conjunction with the principle of 
secondary reinforcement. This second and more extended gradient 
may with some propriety retain the original name of the goal 
gradient, an expression employed by the present author in his first 
discussion of the subject (6). 

Unfortunately it is not yet clear in exactly what quantitative 
manner the basic gradient of reinforcement, in combination with 
secondary reinforcement, generates the more extended goal gradient. 
We dOy however, have a number of promising leads. Because of 
the intimate relation knowm to exist betw^een the conditioning of 
a stimulus to a reaction and the acquisition by that stimulus of 
^condary reinforcing power (p. 100), it is plausible to assume 
that stimuli acquire this power according to the primitive gradient 
of reinforcement demonstrated by Perin. It is also assumed that 
once a stimulus has acquired a certain amount of the capacity for 
secondaiw" reinforcement, this will immediately begin to operate 
according to Perin's primith’-e gradient of reinforcement to rein- 
force antecedent receptor-effector connections and to endow each 
newiy associated receptor process with secondary reinforcing 
powders, and so on. Thus the goal gradient would result from the 
summation of an exceedingly complex series of overlapping gradi- 
ents of reinforcement, in part consisting of, but largely derived 
from, the ^^primary” reinforcement occurring at the end of the 
temporal period covering the behavior sequence involved. Also, the 
generation of overlapping secondary gradients presumably would 
take place under ordinary learning conditions not only beyond the 
range of Perin^s primitive gradient of reinforcement, but within its 
range as well, so that the period preceding ‘^primary” reinforcement 
by 30 or 40 seconds would pr^ent a picture not very different from 
other portions of the total range, though "Wolfe^s results (Figure 
30 and 31) suggest that the characteristics of the goal gradient for 
the first minute of the delay in reinforcement may differ somewhat 
from the more remote portions. In view of the all but universal 
prevalence of tiie conditions w^hich generate secondary reinforce- 
ment, coupled with the great difficulty experienced by Perin in 
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eliminating them from his experiment, it is fairly evident that the 
principle immediately concerned in ordinary learning situations is 
what we have called the goal gradient, whereas both the gradient 


TABLE 2 

Tms Table Shows the Theoretical Habit Strengths in Habs op Re- 
ceptor-Effector Conjunctions With Unlimited Practice When the Reac- 
tion Is Followed by Reinforcement After Varying Amounts of Delay 
IN Seconds (i). It Is Assumed That With Zero Delay the Reinforcing 
Agent Employed Would Yield a Habit Strength (M') of 80 Habs at the 
Limit op Practice, and That Each Additional Second op Delay Reduces 
THE Limit of Hab it Strength by 1/65th. 


Amount of 
Delay 
(0 

Habit- 

Strength 

Limit 

(mO 

Amount of 
Delay 
(0 

Habit- 

Strength 

Limit 

(m') 

Amount of 
Delay 
(t) 

Habit- 

Strength 

Limit 

im') 

0 

80.00 

30 

50.29 

60 

31.61 

1 

78.77 

31 

49.52 

65 

29.26 

2 

77.56 

32 

48.76 

70 

27.08 

3 

76.37 

33 

48.01 

75 

25.07 

4 

75.20 

34 

47.27 

80 

23.20 

5 

74.04 

35 

46.55 

85 

21.48 

6 

72.91 

36 

45.83 

90 

19.91 

7 

71.79 

37 

45.13 

95 

18.39 

8 

70.68 

38 

44.44 

100 

17.02 

9 

69.60 

39 

43.75 

105 

15.76 

10 

68.53 

40 

43.08 

110 

14.58 

11 

67.48 

41 

42.42 

115 

13.50 

12 

66.44 

42 

41.77 

120 

12.49 

13 

65.42 

43 

41.13 

125 

11.56 

14 

64.42 

44 

40.50 

130 

10.70 

15 

63.43 

45 

39.87 

140 

9.17 

16 

62.46 

46 

39.26 

150 

7.85 

17 

61.50 

47 

38.66 

160 

6.73 

18 

60.45 

48 

38.07 

170 

5.76 

19 

59.62 

49 

37.48 

180 

4.94 

W 

58.71 

50 

36.90 

190 

4.23 

21 

57.81 

51 

36.34 

200 

3.62 

22 

56.92 

52 

35.78 

210 

3.10 


56.04 

53 

35.23 

220 

2.66 

24 

55.18 

54 

34.69 

230 

2.28 

25 

54.34 

55 

34.16 

240 

. 1.95 

26 

53.50 

56 

33.63 

270 

1.23 

27 

52.68 

57 

33-12 

300 

.77 


51.87 

58 1 

32.61 

330 

.49 

29 

51.07 

59 

32.11 

360 

.31 


of remforcement and secondary reinforcement are represented in 
tbe goal gradioat, which they presumably generate. 

In the inters of immediate utilization we accordingly proceed 
to tile consideration of the more detailed characteristics of the goal 
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gradient. On the basis of the direct experimental approaches out- 
lined above, together with certain indirect approaches presently to 
be disclosed, we formulate our hypothesis concerning the molar 
functional relationship of habit strength to the temporal delay in 
reinforcement as follows: (1) The maximum habit strength {m') 
attainable with a given amount and quality of reinforcement closely 
approximates a negative growth function of the time (t) separat- 
ing the reaction from the reinforcing state of affairs; (£) the asymp- 



Fig. 33- Theoretical goal gradient plotted from the values ^own in Table 
2. This ciir\’e is drawn on the assumption that the value of habit strength at 
the limit of practice is reduced l/65th for each additional second of delay 
in reinforcement. The braces on the vertical sc^le ^ow the theoretical 
difference in habit strength which would be produced by each at the limit of 
piactice. Note (1) that none of these differences is like any of the otha^ 
and (2) that the middle difference is the greatest of the three. 

tote or limit of fall of this gradient is zero; and (S) the more 
favorable the condition for the action of secondary reinforcement, 
the slower will be the rate of fall, so that this limit may not be 
approximated until after considerable periods of delay, thowgh for 
many conditions less favorable for secondary reinforcement it may 
be reached in a period of from SO to 60 seconds. 

For purposes of precise illustration, the values of such a nega- 
tive growth function have been calculatal and are reproduced as 
Table 2, on the assumption that the conditions of secondary rein- 
forcement are such as to bring the gradient practically to zero at 
a delay in “primary” reinforcement of about six minutes. The 
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values appearing in this table are represented graphically by the 
curve shown in Figure 33. 

With a definite hypothesis as to the quantitative character- 
istics of the goal gradient available, the possibility at once arises 
of deriving from it, and the other principles of the system, numerous 
implications in the form of corollaries. A comparison of these 
deductions with relevant experimental evidence then affords a basis 
for the acceptance, rejection, or further modification of the hy- 
pothesis. With good fortune, this indirect procedure, supplementing 
that of the direct experimental attack already considered, may be 
expected to lead to the isolation of a sound and comprehensive 
scientific principle more quickly than would either approach em- 
ployed alone. 

ORGANISMS GRADUALLY ACQUIRE A PREFERENCE FOR THE ACT 
FOLLOWED BY THE SHORTER DELAY IN REINFORCEMENT 

Let it be assumed that hungry albino rats are presented with a 
choice of two short passageways; at the end of each is an exactly 
similar food reinforcement. In both alleys alike there is, next to 
the food box, a delay chamber in which the animal is retained for 
a certain period before being permitted access to the food. More 
specifically, let it be assumed that the delay in one chamber is 
30 seconds and that the delay in the other is 60 seconds. Finally, 
durii^ training one or the other alley is always blocked, the order 
of the reinforcements on the respective alleys being randomized in 
such a way as to keep approximately the same number of rein- 
forcements on each at all times. Therefore, at any point in the 
training at which it is desired to know the relative strengths of the 
two habits thus set up, the entrance to both alleys may be left open, 
at which time a competition between the two habit strengths will 
occur, the stronger habit of course dominating. By training groups 
of comparable animals with different numbers of reinforcements 
before the testing, it would be possible to determine relatively how 
strong the two habits are at various stages of training, and in this 
indirect manner verify empirically the goal gradient hypothesis 
formulated above. 

Table 2 shows that the theoretical limit (m') of habit strength 
at a delay of 30 ^onds is 50.29 habs, and the limit at 60 seconds 
m 31.61 habs. With ihese asymptotes of the positive growth, or 
laming, curves available, and taking our fractional increment (F) 
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TABLE 3 

Columns 2, 3, and 4 of This Table Show the Theoketical Habit Stbength 

AT THE SuCCESSI'VT: REINFORCEMENTS BY THE SaME REINFORCING AGENT WhERB 

THE Fractional Increment (F) Is 1/20, and Where the Delay in Rein- 
forcement Is 30 ^ 60^, AND Respectively. The Values Employed 
IN the Computation of These sHr Values Were Taken from Table 2. 
The Values in Columns 5 and 6 Represent the Pee Cent of Test Trials 
IN Which the Habit With the Shorter Delay in Reinforcement Would 
Be Expected to Dominate on the Assumption That the Range of Oscilla- 
tion Has a Standard Deviation of 13 Habs. 


Ordinal 
Number of 
Reinforce- 
ment 

= Strength of 
sHr in 
Habs at 30*' 
Delay in 
Reinforce- 
ment 

; Strength of 
sHr in 
Habs at 60*" 
Delay in 
Reinforce- 
ment 

1 ^ 

• Strength of 

1 sHr in 

1 Habs at90*' 
Delay in 
Reinforce- 
ment 

j Per Cent 
of Trials 
at Which 
the 30 
Habit 
Dominates 
Over 60*^ 

Per Cent 
of Trials 
at Which 
the 30^ 
Habit 
Dominates 
Ch^er 

1 

2 


4 

5 

6 


0.0 

0.0 

0.0 

50.0 

50.0 

1 

2.5 

1.6 

l.C 

52.0 

53.3 

2 

4.9 

3.1 

1.9 

53.9 

56.4 

3 

7.2 

4.5 

2.8 

55.8 

59.3 

4 

9.3 

5.9 


57.5 

62.0 

5 

11.4 

7.2 


59.1 

64.6 

6 

13.3 

8.4 


60.6 

66.9 

8 

16.9 

10.6 


63.4 

71.1 

10 


12.7 

8.0 

65.8 

74.6 

12 

23.1 

14.5 

9.2 

68.0 

77.6 

lo 

27.0 

17.0 

10.7 

70.7 

S1.2 

18 

30.3 

19.1 


73.0 

84.0 

21 

33.2 

20.8 

13.1 

74.9 

86.2 

25 

36.3 

22.8 

14.4 

76.9 

88.4 

30 

39.5 

24.8 

15.6 

78.8 

90.3 

35 

41.9 

MA 

16.6 

80.2 

91.6 

40 

43.8 

27.5 

17.4 

81.2 

m.5 


at l/20j there may generated the progressive learning values 
shown in eolmnns 2 and 3 of Table 3. For purposes of ready com- 
parison, these learning curv^ are represented graphically in Fig- 
ure 34. It will be noted at a glance that the ^i^ond curve 
gradually rises above the 60-second curve, the di^ance separating 
them increasing as the number of reinforcements increa^. This 
fields our first corollary: 

1. The shorter the delay in reinforcement, the steeper becomes 
the rise of the associated curve of learning. 

To be of critical scientific value, a theoretical deduction should 
lead to the po^ibility of comparison with a relevant empirical 
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observation. Unfortunately, Corollary I as it stands does not 
permit an observational check because habit strength, as such, 
cannot be observed. However, by combining it with the well- 
known principle that the greater the habit strength, the shorter will 
be the time of reaction evocation (p. 105) , we easily derive a second 
corollary which is susceptible of such verification: 

II. When a reaction is reinforced after a short delay, the time 
required to execute the act will be less than that required to execute 



Fig. 34. Parallel theoretical learning curves with the same rate of rise 
(F = 1/20) but with different asymptotes as determined by the respective 
periods of delay in reinforcement (30 and 60 seconds) as shown in Table 2. 
The above curves were plotted from columns 2 and 3 of Table 3. 


a comparable act which has had the same number of reinforcements 
but in which the delay of the reinforcements has been longer. 
Empirical confirmation of the essential soundness of Corollary 2 
is furnished by a number of studies, notably one reported by Ander- 
son (f). This experiment was set up {1) in substantially the 
manner postulated in the theoretical arrangement just described, 
with the ^ception that the animals were permitted to choose freely 
bet^mi the two entrances throughout the training process; the 
inw^fetgator’s labor was thereby greatly economized. Early in the 
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training this procedure would, of course, begin to give a dispro- 
portionately large number of reinforcements to the reaction asso- 
ciated with the shorter delay. While this doubtless weakened, 
relatively, the act involving the longer delay in reinforcement, it 
probably did not materially change the outcome so far as the 
present set of corollaries is 
concerned. 

Anderson’s animals had 
to cross a platform on the 
way to the retention cham- 
bers, the distance amount- 
ing to seven feet. It hap- 
pens that in this study two 
pairs of delays are reported 
which are alike in their 
ratio (1:3), and closely sim- 
ilar in their empirical dis- 
criminability (83 per cent 
and 80 per cent after 40 
reinforcements), yet the 
lengths of the delays are 
very different: 10 and 30 
seconds as compared with 
120 and 360 seconds, the 
lengths of the second pair 
of delays being twelve times 
those of the first. The mean 
runway tim^ for each of 
Ihe eight days of training 
of the resistive groups of 
animals are shown in Figure 35, where it may be seen at a glance 
that the acts associated with the pair of short delays have a lower 
mean reaction time throughout the entire training period, which 
thus agrees with the corollary. 

At this point we must introduce a factor to be taken up in 
detail a little later — ^that of the spontaneous oscillation or vari- 
ability in habit strength (p. 304 ff.). It will be sufficient here to 
say only that there is reason to believe that the effective strength 
of all habits when functioning as reaction potentials is subject to 
continuous uncorrelated interferences, presumably mainly from 
proc^es arising spontaneously within the nervous system, and that 



Fig. 35. Parallel curves showing differing 
locomotor times for the same distance, 
training, and discriminability, but markedly 
different delay in reinforcement. (Plotted 
from Anderson^s published Table 1, p- 
424.) 



the magnitudes of these disturbances are distributed approximately 
according to the “normal” law of probability. On the assumption 
that the oscillations in habit strength are largely uncorrelated, it 
follows that the weaker of two competing habit tendencies will fre- 
quently be dej^essed only slightly when the stronger one chances 
to be depressed relatively much, with the result that the weaker 
habit will dominate on that occasion. Indeed, this failure of the 
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There it will be seen that, theoretically, at first the two choices 
are equally likely, i.e., the entrance to the alley leading to the 
chamber associated with the 30-second delay should occur on only 
50 per cent of the test trials because the two probability distribu- 
tions coincide exactly. However, as practice progresses the choice 
associated with the shorter delay gradually attains an advantage 



Fig. 37 . Gmph ghowing empirical curi^es of increasing preferences for the 
reaction involving the shorter of two delays in reinforcement. (Plotted from 
tables published by Andei^n, J.) 


which, after 40 pairs of reinforcements, reaches the considerable 
amount of 81.2 per cent. 

On the basis of the above calculations we generalize and formu- 
late the following corollaries: 

III. With training j organisms tend to choose that one of a pair 
of alternative acts which yields reinforcement with the lesser delay, 

IV. The preference for that one of a pair of acts involving the 
lesser delay in reinforcement is attained gradtudly as training in- 
creases. 

Empirical confirmation of Corollaries III and IV is seen in the 
study reported by Anderson (f). This investigator found that in 
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the course of 40 trials involving delays of 30 and 60 seconds respec- 
tively, eight albino rats displayed a gain in the choice of the act 
involving the 30-second delay which extended from 47 per cent 
(approximately chance) to 76 per cent. The curve of this learning 
is shown in the lower graph of Figure 37. Moreover, DeCamp (3), 
Yoshioka (f7), and Grice U) have all found substantially the 
same relationship to hold where the delay in reinforcement was 
incidental to the difference in length of two alternative paths lead- 
ing to a reinforcement. It is noteworthy that Grice’s experiment 
was set up in such a way as to keep the number of reinforcements 
on the two paths more nearly equal than was the case in any of 
the other studies so far reported; his results are therefore more 
comparable to the conditions presupposed by the above deductions. 

THE BATE OF DISCRIMINATION OF ACTS AS A FUNCTION (1) 01 
THE DIFFERENCE IN THE DELAYS INVOLVED AND (2) OF 
THE ABSOLUTE MAGNITUDE OF THE DELAYS 

Our next problem concerns the relative rate of preference acqui- 
sition of alternative reactions involving differential delays in rein- 
forcement as a function of the amount of difference in the two 
delays. Let us take, for example, the same theoretical arrangement 
assumed above, with the exception that the delays are of 30 and 
90 seconds respectively, which gives the organism the relatively 
coarse ratio of 1 to 3 instead of 1 to 2, as previously. The theo- 
retical course of the acquisition of habit strength with a 30-second 
delay in reinforcement may be seen in the second column of 
Table 3, and that of a 90-second delay, in the fourth column. The 
I^r cent of the trials in which the act associated with the 30-second 
delay of reinforcement would be expected to dominate over that 
a^ciated with the 90-second delay is given in the sixth column 
and is represented graphically in the upper (solid) curve of Fig- 
ure 36. There it may be seen at a glance that the larger difference 
yields the more rapid acquisition of dominance by the act involving 
the lesser delay of reinforcement. Specifically, at the fifteenth 
reinforcement the discrimination involving the 30-90-second delay 
reached the level of dominance attained at the fortieth reinforce- 
ment by the one involving the 30-60-second delay. Generali25ing, 
we arrive at our fifth and sixth corollaries: 

V. Other things equal, the greater the difference in the delay of 
remforcmnmt of tmo competing reactions , the less mill he the train^ 
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ing required to give the act involving the lesser delay a given degree 
of dominance, 

VI. Other things equal, the coarser the ratio of the delay of 
reinforcement of two competing reactions, the less will be the train^ 
ing required to give the act involving the lesser delay a given degree 
of dominance. 

Corollaries V and Y1 also find ready empirical verification in 
the investigation reported by Anderson (Jf). For purposes of easy 
comparison, the experimental learning scores of Anderson’s 30-60- 
second group of animals (a ratio of 1 to 2) are presented graphi- 
cally in Figure 37, in parallel with the results from his 30-90-second 
group ( a ratio of 1 to 3). While somewhat irregular, as is to be 
expected from the relatively small number of animals employed 
in the respective groups, the general relationship shown by the two 
theoretical curves of Figure 36 is discernible. The 30-90-second 
discrimination reached, between 15 and 20 reinforcements, a degree 
of short-delay dominance only attained by the 30-60-second com- 
bination between the 35th and 40th reinforcements. Completely 
parallel results were obtained by Y^oshioka (17) in the discrimina- 
tion of pairs of alternate paths to a goal. He reports that a given 

TABLE 4 


Table Showing the Per Cent op Choices of Acts Associated With the 
Shorter of Two Delays to Be Expected on Theoretical Grounds and 
the Parallel Empirical Values Reported by Anderson (i, p. 54). 


Ratio 

j Delays of 
i Reinforcement 

I Compared 

Theoretical Per Cent 
Choices of Act 
Involving Shorter Delay 

Empirical Per Cent 
Choices of Act 
Involving Shorter Delay 

1 :3 

120L*36D' 

71.8 

82 

1 :3 

60‘’:180' 

89.7 

87* 

1 :3 

30^:90*^ 

92.5 

89 

1 :3 

10":30^ 

80.6 

SO 

1 :2 

120^:240^ 

69.1 

70 

1 :2 

60^:120^ 

81.8 

85 

1 :2 

30^:^^ 

81,2 

76 

1 :2 

10 ^: 20 ’^ 

67.9 

74 

1 : 1.5 

120*':180*' 

64.0 

63 

1 : 1.5 


71.0 

66 

1 : 1.5 


68.9 

* • * 

1 ; 1.5 

10^:15*^ 

59.6 

... 


* At point in Andea^n^s article this value is given as 84 i»r cent, but in the original table, 
as wen as in his thesis on file in Yale University, the mean of wliich has been checked, the value is 
Sr per cent. 




154 


PRINCIPLES OF BEHAVIOR 


amount of training on two alternative alleys 210 and 233 inches 
in length (a difference of 23 inches and a ratio of about 1 to 1.11) 
yielded a preference for the shorter path of 10.80 units, whereas 
when the 210-inch alley was paired with a 276-inch alley (a dif- 
ference of 66 inches and a ratio of about 1 to 1.3), the same amount 
of training yielded a preference for the shorter alley of 18.85 units. 

Our next problem concerns the relative ease of discriminating 
acts involving a given amount of difference in the delay of rein- 
forcement as dependent upon whether the length of the shorter of 
each pair of delays is absolutely short or long. For example, what 
does our hypothesis imply as to the ease of differentiating acts 
associated with delays of 30 and 60 seconds as compared with 
those of 60 and 90 seconds when both pairs of delays involve ex- 
actly the same absolute difference, 30 seconds? We have already 
seen (Table 3 and Figure 36) that theoretically 30 and 60 seconds 
yield at 40 reinforcements 81.2 per cent choice of the 30-second 
act- Table 4 shows that at 40 reinforcements the 60-second choice 
over the 90-second choice, theoretically, occurs on 71-0 per cent of 
the trials. But, si o 71 n 


Generalizing, we arrive at our seventh corollary: 

VII- Equal differences in the delays of reinforcement of two 
competing acts lead with equal practice to a lower per cent of 
preferential choices of the act associated ivith the shorter delay 
when the two delays are large {i.e., when the ratio is relatively 
coarse) than when they are small {i.e,, when the ratio is finer). 

This corollary also finds complete empirical confirmation. In 
the Anderson study already cited, it was foimd (column 4, Table 3) 
that at 40 reinforcements delays of 30 and 60 seconds gave 76 
cent preference for the 30-second act, whereas delays of 60 and 
90 seconds gave a preference of only 66 per cent to the 60-second 
act. Similarly, Grice found that alternative alleys of six feet and 
twelve feet, with a six-foot difference, were discriminated in a 
mean of 12.4 trials, whereas alleys of 24 feet and 30 feet with 
exactly the same difference were discriminated only after a mean 
of 28.4 trials, i.e., after about twice as much training. 


THE EEIATIOH OP THE DELAY IN EEirsTFORCBMElsrT 
TO WEBEit’S LAW 

Experimaital results such as those just summarized under Corol- 
laries VI and Vn have led to the view that the- discrimination 
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between inten^ais of delay in reinforcement as found by Anderson 
(ij and between altematix’^e paths to reinforcement as found by 
Yoshioka [171 indicates the conformity of these learning processes 
to Weber's law. Indeed, influenced largely by the views of Y^oshi- 
oka, the present writer first postulated the gradient of reinforce- 
ment as being a logarithmic function of the delay of reinforcement, 
w’hich is implicit in the Weber s law hypothesis; further considera- 
tion, which revealed certain mathematical paradoxes arising from 
the nature of the logarithmic function (7, p. 273) led to its aban- 
donment in favor of the exponential or negative growth function 
represented in Table 2 and Figure 33. 

This problem brings us to our eighth corollary. Weber’s law, 
as applied to the delay in reinforcement, requires that all pairs of 
delays of equal ration, e.g., 1 to 2, with equal amounts of training, 
yield equal per cents of preference for the act associated with the 
shorter delay. The pr^nt (exponential) hypothesis leads to quite 
different expectations. Suppose that we have four pairs of acts, 
all with delays in the ratio of 1 to 2 as follows; 

1(K' vs. aK'; 20" vs. 60"; 60" vs. 120"; 120" vs. 240". 

By Table 2, these pairs of delays at the limit of practice generate 
the following habit strengths: 

68.53:58.71; 50.29:31.61; 31.61:12.49; 12.49:1.95. 

Appropriate computations based on the same assumptions as out- 
lined above show that at 40 reinforcements the respective situations 
would generate the following pairs of habit strengtfis: 

59.71:51.15; 43*81:27.54; 27.54:10.88; 10.88:1.70. 

Further calculations show that these pairs of habit strengths cor- 
respond to the following per cents of preference for the shorter 
delays: 

10":20" =67.9 per cent 
30":^" =81.2 per cent 
60":120" = 81.8 per cent 
120":240" = 69.1 per cent 

Comparable computations have been made for a series of delays 
which stand in a coarser ratio of 1 to 3 and in a fi,ner ratio of 1 to 
1.5. The theoretical outcome of all three sete of delays has been 
drawn up systematically in Table 4. 



PRINCIPLES OF BEHAVIOR 


156 

An examination of the third column of this table shows that 
according to the present set of hypotheses, it is not to be expected 
that all pairs of acts whose delays of reinforcement stand in the 
same ratio will be equally discriminable. On the contrary, it is 
evident that while the several entries under a given ratio show some 
resemblance, they also show marked differences. Moreover, these 
differences manifest a characteristic pattern. First, there is a cen- 
tral point of maximum discriminability from which the ease of dis- 
crimination decreases as the absolute magnitude of the delays either 
increases or decreases; thus in the ratio of 1:3 the combination of 
30-90 seconds gives a maximum of 92.5 per cent, whereas that of 
60-180 seconds, *a pair of larger values, gives 89.7 per cent, and that 
of 10-30 seconds, a pair of smaller values, gives 80.6 per cent 
Second, as the ratio of the delays grows finer, the point of maxi- 
mum ease of discrimination shifts in the direction of the longer 
delays; thus the maximum ease of discrimination falls in the 1:3 
ratio on the combination of 30-90 seconds, whereas at the 1:2 ratio 
it falls on the larger values of 60-120 seconds. 

These a 'priori expectations find striking empirical confirmation 
in the experimental results reported by Anderson (1). The rele- 
vant empirical values have been arranged in column 4 of Table 4, 
in parallel with the theoretical values. There it may be seen that 

(1) the discriminability of acts associated with different delays in 
reinforcement is in fact not equal, i.e., Weber’s law does not hold; 

(2) the combination 30-90 seconds has the maximum ease of dis- 
criminability of the 1:3 ratio exactly as calculated, from which the 
ease of discrimination falls off in both directions; (3) the combina- 
tion 60-120 seconds has the maximum ease of discriminability of 
the 1:2 ratio, a shift in the direction of the larger values, from 
which the ease of discrimination diminishes in both directions, 
again exactly as demanded by the theory. 

A similar outcome in favor of the exponential and opposed to 
the logarithmic relationship was obtained by Grice (4) in the ease 
of discriminating different ratios of alternate paths to food rein- 
forcement. While Grice^s experiment was not designed in such a 
way as to bring out the exponential relationship as dramatically as 
do^ Anderson’s study, this characteristic in his results was effec- 
tively demonstrated by means of a process of curve fitting. 

Gaieralizing from the preceding observations, we may formulate 
our eighth corollary as follows: 
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■^TII. The ease of discrimination of acts associated with dif^ 
ferent delays of reinforcement has for a given ratio of delay lengths 
a maximum occurring at a central point of absolute lengths, from 
which point it diminishes progressively with both increase and de- 
crease in the absolute magnitude of the delays, the point of maxi- 
mum ease of discrimination shifting in the direction of the longer 
delays as the ratio of the two delays grows finer. 

Extrapolating from the same general considerations, we may 
formulate a ninth corollary: 

IX. With the ratio between delays of reinforcement constant, 
discrimination becomes impossible when the periods of delay in- 
volved become sufficiently great or sufficiently small. 

SUMMAEY 

The favorite methcKl employed by experimentalists in determin- 
ing the functional relationship of the mte of learning to the delay 
in reinforcement has been to pre^nt an organism with the pos- 
sibility of performing two alternative acts one of which receives 
reinforcement after a shorter delay than the other. The organism 
is then permitted gradually to develop a preference for the act 
involving the shorter delay by a process of trial and error, the 
number of trials required to reach a given preference criterion being 
taken as an indication of the ease of the ^‘discrimination.” In the 
purer forms of these experiments the acts are strictly comparable, 
e.g., merely turning to the right or the left and walking a few inches 
or feet. The alternative acts of some experiments, however, require 
the organism to traverse differaat lengths of paths to reach the 
point of reinforcement, in which case not only is there presumably 
m%"olved the delay required to traverse a given path, but there is 
introduced a third, though closely parallel factor, the amount of 
work or energy expenditure required in traveising the different 
distances (see p. 280 ff.). 

The early experimental work on the relation of the rate of 
learning to the extent of the delay in reinforcement yielded nega- 
tive results. This 'was apparently because in the procedures em- 
ployed, the food box served as the' retention chamber, which intro- 
duced in a gross manner the factor of secondary reinforcement. 
When a separate compartment adjoining the food box was used 
as a retention chamber, the spread of secondar}^ reinforcing ten- 
dencies was sufficiently gradual to show in one study, that of 
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Wolfe, a negatively accelerated falling gradient extending to delays 
of reinforcement 20 minutes in duration. Perin, on the other hand, 
in an experiment set up in such a way as to exclude secondary 
reinforcement as completely as possible, found a negatively accel- 
erated gradient of rate of learning which fell to zero at a delay of 
only 30 seconds or so. 

The most plausible interpretation of these superficially conflict- 
ing results seems to be (1) that the basic or primary gradient of 
reinforcement is only about 30 seconds in duration, and (2) that 
under ordinary learning conditions secondary reinforcement com- 
bines with the gradient of reinforcement to produce a derived phe- 
nomenon which may be called the goal gradient. 

Partly on the basis of direct experimental evidence, but mainly 
on the grounds of indirect evidence, it is concluded that the most 
plausible hypothesis concerning the quantitative characteristics of 
the goal gradient is: In situations where both (1) the primary 
gradient of reinforcement and (2) the principle of secondary rein- 
forcement are operative in a progressive manner, the goal gradient 
is an eTponential or negative growth function, and the greater the 
influence of secondary reinforcement, the less steep will be the slope 
of the gradient 

From this postulate or hypothesis, together with other principles 
of the systen, a number of corollaries follow. The first of these is 
that, other things equal, the shorter the delay in the reinforcement 
of a given act, the steeper will be the curve of learning of that act 
and so, at any given number of reinforcements, the shorter will be 
the time required to execute the act; this is in agreement with 
experimental findings. 

In the typical trial-and-error situation involving alternative 
acts with different delays in the reinforcement of each, it follows 
from the goal gradient hypothesis as here formulated that the act 
a^^iated with the shorter delay of reinforcement will acquire 
habit strength at a faster rate than will the alternative act. How- 
ever, because of the principle of oscillation, the stronger of the 
two habits would be expected to attain dominance, not at once but 
only gradually, and often imperfectly, even after a very large num- 
ber of trials. This a priori expectation is confirmed by experiment. 

On the same principle it is to be expected that, other things 
equ^, the larger the difference in the delays associated with the 
(^mpetang acts, the fewer will be the trials required to produce a 
cmt of domin^ce. Similarly, for a given difference in 
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the delays associated with the respective acts, the longer the actual 
delays involved, the greater will be the number of trials required 
to produce a given per cent of dominance. Both these corollaries 
also are confirmed by experiment. 

Perhaps the most critical test of the hypothesis concerns the 
ease of learning to discriminate comparable acts associated with 
delays of equal relative, but of different absolute, duration. The 
exponential goal gradient hypothesis as here formulated implies 
that the discriminability of acts associated with delays of reinforce- 
ment standing in the same ratio to each other will be maximal at 
a central region of absolute delays, but at other delays, whether 
greater or less, the ease of discrimination declines progressively. 
Moreover, it follows from these same principles that the point of 
maximiim ease of discrimination of a coarse ratio, such as 1 to 3, 
W’iil appear at smaller absolute valu^ than will be the case with 
a fine ratio, such as 1 to 2. All of these intricate and detailed im- 
plications of the exponential goal gradient hypothesis are in re- 
markable agreement with the experimental observations at present 
available. This fact gc^ far to indicate that the goal gradient is 
a negative growth, or exponential, function. Incidentally this fur- 
nishes an excellent e.xample of the manner in which indirect pro- 
cedures may sometimes yield the characteristics of a function 
which has proved refractory to direct attack. 


NOTES 
Historical Note 

The first mentioii of the gradient-of-remforcement lainciple which we have 
been able to find was by Thorndike, in 1913. At that time he wrote: “Such 
intimacy, or cic^ne^ of connection between the satisfying state of affairs and 
the bond it affecte, may be due to clcse temporal sequence. . . Other things 
being equal, the same degree of satisf^ungn^ will act more strongly on a bond 
made two seconds previously than on one made two minutes previously. . 
illy pp. 172-173). Thirteen years later, Margaret Washburn remarked in the 
third edition of The Animal Mind: “'The facts show that it [the drive] will set 
mc^ stron^y in reading thc^ movements which mc^t immediately preceded 
its Isolation on a previous occasion. This 'gradient* of excitation from move- 
mente just before the final 'succe^’ step by step to those at the beginning of the 
i^ri^, may also be explained by the ordinary associative laws. The movements 
near^ the end of the seri^ have a greater reaxiiness due to recency of perform- 
(14 P. 335). 

In 1932 the same general (»n<^pt was put forward by the present writer in 
an attempt at a deductive explanation of numerous molar phenomena of animal 
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I learaiiigj such ^ the preference of the shorter path to a goal, the fact of 
Wind ailej elimination, the backward order of such eliminations, and so on: 
'"The mechanism which in the present paper will be mainly depended upon as ah 
exphriatory and integrating principle is that the goal reaction gets conditioned 
the most strongly to the stimuli preceding it, and the other reactions of the be- 
havior ^quenee get conditioned to their stimuli progressively weaker as they are 
more remote (in time or space) from the goal reaction. This principle is clearly 
that of a graient, and the gradient is evidently somehow related to the goal 
We shall sceordingjy call it the goal gradient hypothesis.” (6, pp. 25-26.) 

In order to perform these deductions it was necessary to postulate the mathe- 
characteristics of the gradient of habit strength which both Thorndike 
aid; Washlmira had sp^ified in a general way. Largely because of the results of 
Ycribioka's InveBtigation (17), habit strength was postulated as being a function 
of k^uithm of the amount of time (0 or space separating the receptor- 
injunction from the reinforcing state of affairs, i.e., 

sHb = a — 6 log h 

In lOT, however, the Ic^arithmic gradient was rejected on the grounds (1) that 
when I « 0, sHb becomes infinite, and (2) that with large values of t, sHr becomes 
lx>th of which ^m a priori improbable (7, p. 273). Accordingly the 
logarithmic equation was replaced by an exponential equation. In terms of our 
pg'^at notation this is: 

(13) 

where M' is l€^jming asymptote under a given reinforcing agent, t has the 
mm& sigaificMLce as in the logarithmic equation, and m' is the learning asymptote 
with a given <klay in reinforcement and reinforcing agent. The exponential 
^nation, becau^ of the excellent agreement of its implications with a consider- 
able range of empirical findings, is regarded as the closest approximation to the 
gml gradkat function at pr^nt available. 

The Wolfe Data 

Fortunatdj^, Wolfe (16) published the per cent of correct runs for each of 
Mi ei^t pmi|M rats at ^ch of the ten days of the training process. An exami- 
of data suggested that the scores of days 7, 8, 9, and 10 are the most 
and agnifimat of the scri^ for our present purpc^es. Accordingly th^ 
scores for the four days were pooled for each group of 

^xt stqp was to convert th^e probability-of-success scores into units of 
cm wsm Hn^r ^ale. Ihis was done on the assumption (see Chapter 
XIII) that manifests itself in this experiment only by overriding an 

teackmey varying from moment to moment, the net resxilt of which, 
» average, m to pve abmit equal numbers of choices to the right and the left 
l^s daMKse collation factor is aj^umed further to distribute itself 
acMitiing k) the Caui^an or “normal” law of chance. Con- 
w^^mt d tibis fiUKtion have been derived by mathematicians, whereby 
per erf may be at onc^ onverted into amounts in presumably 

in is usiMly flie standard deviation (tr) of the 

d vaiiiMhty involved. A simplified table of this kind 
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is shown as Table 9 on page 311. However, a more elaborate table was used 
with the present data. The use of the table may be illustrated as follows : the 
pooled scores of days 7, 8, 9, and 10 of the one-minute group show 64.9 per cent 
of correct reaction. Eeferring to our table we find that .this corresponds to a shift 
.40 (T from the pure chance (50-^) distribution of choices in the correct direction. 
In this way were obtained the following <r-values which may be regarded as 


indict of habit strength : 

Amouni of delay 

0 *^ 

5 ' 

30' 

1 ' 

2.5' 

5' 

10 ' 

20 ' 

Figure dO is {dotted from these values, 
the first four of the values is: 


I Tide jp of habit strength at 
trials 7 , < 9 , 9 ^ and 10 
1.21 <r 
.98<r 
.49 0- 
.40 0- 
.42 0* 

.28 (T 

.30 (T 
.18 cr 

The negative growth function fitted to 


m' = .375 4* (1.21 .375)e-*06«‘, 

Ihe smooth curved line of figure 31 is plotted from this equation. 

Some indirect confirmation of the hypothesis that more than one factor is 
operating to produce Wolfe's empirical gradient is furnished by the fact that it 
was found possible to get a rather satisfactory fit to the above set of values by 
assuming that they represent the simple summation of two growth functions, a 
major one corresponding roughly to the equation given above, and a minor one 
representing a rather slow learning process with a gentler slope. The complete 
equation, including both of the supposed components, is: 

771' = .175 + .225 ^ .gio *. 


The Characteristics of Perm’s Empirical Gradient 

The curve 'which has been fitted to Perin's empirical gradient of reinforcement 
is a diminishing exponential function which has as its asymptote, not a horizontal 
straight line as have ordinary exponential functions, but a straight line which slopes 
downward in such a way as to cross the horizontal axis. This equation, shown 
graphically in Figure 32, is: 

N ^ 40 

tan S AsHr = 1.6 X lO’-is * - .043 f -f 1.45, 

iV *60 

i\r = 40 

in which tan S AsHr represents the tangent of the curve of feaming at ^ 

=60 

point of 50 per cent choice of the act which remo'^es ^ manij^Ialive 
it will be notic^ is by no means the same as habit str^agt^ (aSW* Mofeow, 
is reason to believe that appreciable amounts extinction war 
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tribute to this fuaction wKich. may account for the fact that extrapolation where 
I > 34 yields negative valu^ 

Bc^ the Delay-of-Reinforcement Gradient Extend Forward as WeU as 
Backvi’ard from the Point of Reinforcement? 

Thorndike and his associates have reported evidence which suggests that 
the gradient of reinforcement may extend also in the forward direction; i.e., 
that K^rritfi g imy be reinforced when the reinforcing state of affairs precedes 
tte S — i? conjunction as well as when it follows it. Recently Jenkins (8) has 
mnhrmed th^ r^ilts in a convincing manner, using albino rats as subjects 
in a situation. These studies indicate that the maximum of the forward 
gradient m wnsderably lower than that of thej^ackward gradient. The adaptive 
^nificance of this se<x>ml gradient has not as yet been very carefully studied. 
Howwer, the reduction in a need necessarily follows rather than precedes 

act which brings this about, it would seem that the forward gradient could 
hardly play much rdle in sel^tive learning. 

Yoshioka's Study of Path-Length Discrimination in Rats 

A number of years ago, Yoshioka (17) investigated experimentally the relative 
difficulty of setting up a consistent preference for the shorter of two paths to a 
fixsd In ^neral he found that the ease of setting up the short-path prefer- 
aice was appmximatdy the same for two mazes, one of which was twice as large 
m the othar, for five different ratios of long to short path in each maze. 

But how can Yc^hioka's results be reconciled with the gradient taken as an 
aqpCHiential functioii of the form, 

m' = 

We have already shown above (p. 156) that if two pairs of delays in reinforcement, 
mx |air twice as icmg as tiie other, are chosen at certain points at each side of a 
CBitol pdnl d numm u m discriminability, one pair will be approximately as 
emf to kam as ti^ other. It is evident that there may be found a very large 
of aich jmirs of alternative delays. It is possible that Yoshioka chanced 
t© a ^ies of p^lrs of alternate pathways involving just such delays in 
rssfcir^menl 

Eqsatwos Frooi Which the Various Theoretical Curves and Tables Have 

Been Derived 

T1^ cw i^ative growth hinction of the goal gradient from which 

Tal^ 2 l^pue 33 (ferived is, 

M m maximu m sfaei^jth attainable with an unlimited number of rein- 
with fcg reinforcing agent employed. 

T^ from wMah Talde 3 and tlm curv^ of Figure 34 were derived is, 

jsEi =. - ot' X 10~-02225W 

w Mxs dM&ied tibe ptec^ecKng ^luatioiL 
, P * ^ derivisag ^ pcx cent of choices of the act associated with the 

water ad^ of niofaceRBeait is first to escalate the strength of the respective 
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habits {Hi and Et) by means of equations 1 and 2 ; then substitute these values 
in equation 15 : 


Et-H. 


(15) 


where o-i is the standard deviation of the oscillation of Ei, and is the standard 
deviation of the mediation of E^. Substitute the values of Ei, E 2 , <ti, and <r 2 
in this equation, solve for Xj and look up the value of p, probability of occurrence 
or per cent of dominance of the stronger habit, in a table of the probability integral. 

Example: Let us take the habit strengths yielded by delays of 30 and 60 
seconds at 40 reinforcements (Table 3, columns 2 and 3). These turn out to be 
43.8 hal^ and 27.5 habs respectively. Also it will be recalled that the oscillation 
range of both habits was assumed to have a standard deviation of 13 habs. Sub- 
stituting these values in equation 3, we have, 

43.8 ~ 27.5 

® “ Vis* + 132 

_ ^ 6.3 
x = ^ . 

Looking <m page 76 of Keliey^s Statistical Tables at a; = .886, it is found that 
con^ponding to this Value is a p value of .812 ; f.c., the stronger of the two habits 
will dominate in the long run 81.2 per cent of the test trials, exactly as is shown 
at the bottom of column 5. 
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CHAPTER XI 


Habit Strength as a Function of the Temporal Relation 
of the Conditioned Stimulus to the Reacdon 

Each of the last three chapters has been concerned with the 
quantitative aspects of one of the antecedent conditions of rein- 
forcement which determine habit strength. In this, the fourth chap- 
ter on this general subject, we shall consider the functional de- 
pendence of habit strength upon a pair of closely related antecedent 
conditions. These have already been laid down (p. 71) as the 
necessary qualitative conditions for learning, namely, that there 
must be a temporal contiguity between an effector activity and (1) 
an afferent impulse or (2) the perseverative trace of such an im- 
pulse. We shall begin with the consideration of the former. 

H.iBIT STRENGTH .‘IS K FUNCTION OF THE DURATION OF 
THE CONDITIONED STIMULUS AT THE TIME 
OF REACTION OCCUTtRENCE 

The question of the rate of habit formation as a function of 
the time the conditioned stimulus (S) has been acting when the 
reaction (i?) occurs has been submitted to systematic study by 
Kappauf and Schlosberg (5). These investigators delivered the 
unconditioned stimulus, in the form of a 1/3-second electric shock, 
to the right front leg of each of a series of albino rats. The con- 
ditioned stimulus was a loud buzzer which temporally overlapped 
the shock. With different groups of animals the buzzer be^BS* 1 /3, 
2/3, 1, 2, 4, and 7 seconds before the shock, both stimuli terminat- 
ing at the same time. Accordingly the habits thus set up would, 
in Pavlov’s terminology' {6, p. 88), be called “delayed” conditioned 
reflexes, as contrasted with “trace” conditioned reflexes (6, p. 40) 
in which the action of the conditioned stimulus terminates before 
the onset of the unconditioned stimulus. 

Kappauf and Schlosberg recorded and measured several differ- 
ent responses to shock which it was thought might become condi- 
tioned. Possibly because of the small number of animals employed 
in each group and the fact that each group was subdivided by 
differential treatment, the various reactions yielded somewhat dis- 
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cordant results. 1^6 single response which gave the most consistent 
conditioned reactions was a sharp inspiration or gasp. By pooling 
measurements taken from published graphs representing the scores 
of the two animals making up each of the six delay groups, there 
has been obtained an indication of what purports to be the func- 
tional relationship which we are seeking. The values so secured 
are represented by the circles of Figure 38. 



Fig. Graphic representation of the per cent of conditioned stimuli 
e’rokiEg antedaiiag reactions in a delayed conditioned reflex, as a function of 
the time mten-al from the beginning of the conditioned stimulus to the onset 
^ th^ »fcoaditioii^ stimulus. (Plotted from pooled measures of two graphs 
^ ^nt of oEmditioiied ^L^ing reactions in rats, published by 
KMpjmmi aM 5.) 

It m at mm eridmt from an inspection of the arrangement of 
ttee circle that long-delayed conditioned stimuli are much less 
tSmtire in acquiring receptor-effector connections under reinforce- 
^at conditions than are those of relatively short delay. More- 
over, l^giiiaiiig at the delay of maximiim efficiency there appears to 
bt m falling off in the per cent of stimulations evoking 

the decline taking place approximately ac- 
im a raaapie s«^tive growth or decay function of the delay 
Such a function was fitted to the valu^ 
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corresponding to the circles, and is represented by the curve drawn 
among them. The as^^mptote or ultimate level of fall turns out 
to be 30.4 per cent. This suggests that generalized response sensiti- 
zation due to shock was occurring in this experiment, or that an 
auditor^' stimulus will not wholly lose its capacit}" for becoming 
conditioned to a reaction however much it may be prolonged, or 
both. 

Turning to the left-hand side of Figure 38, there is found a 
definite suggestion that learning is facilitated by having the uncon- 
ditioned stimulus occur a fraction of a second later than the con- 
ditioned stimulus. The Kappauf-Schlosberg experimental technique 
mak^ difficult the distinction between conditioned and uncondi- 
tioned reactions in tJhis region; therefore the discussion of this point 
will be taken up in connection with an investigation presently to 
be considered, which does not suffer from such a handicap. 

From the preceding analysis, then, we conclude that within the 
limits of the Kappauf-Schlosberg investigation: 

1 . The maximum efficiency of conditioning occurs when the onset of 
the unconditioned stimulus follows that of the conditioned stimulus by a 
fraction of a second. 

2 . As the delay in the onset of the unconditioned stimulus increases 
beyond that yielding the maximum learning efficiency, the rate of habit- 
strength acquisition decreases progressively according to a simple decay 
function of the amount of this additional delay. 

3 . The value of this function at the limit of its fall is about a third 
of that at its highest point. This asymptotic value presumably ap- 
proximate the status in learning situations of static, i.e., non-changing, 
stimulus elements. 


A KEXJROLOGICAIi HTPOTHEBIS AN1> SOME COEOLLiARTES 

The r^ults shown in Figure 38 present such a striking parallel 
to Figure 4 and 5, particularly to Figure 4, that it has been con- 
sidered worth while to give brief consideration to some implications 
of a related neurological hypothesis which has been suggested by 
Kappauf and Sehlosberg (5, p. 39). The relevant neurological 
phenomena may be summarized briefly as follows: 

1. Receptor dischaige impulses b^in an appr^iable interval after the 
impact of the stimulus energy on the receptor ( 1 , p. 116; f). 

2. As the energy impact on the receptor becomes weaker, the discharge 
latency become longer (see Figure 3). 
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3. FollowiBg the period of receptor-discharge latency there is a period 
of relatively rapid recruitment in the frequency of receptor discharge 
impulses, which usually reach their maximum within a second {1, p. 116). 

4. The amount of'^stimulus energy ultimately applied remaining con- 
stant, the faster its rate of application, the more rapid will be the rate 
of reemitment and the greater the maximum frequency of receptor-dis- 
charge impulse (i, p. 75). 

5^ If the rate of stimulus impact is relatively abrupt and constant, 
the greater the stimulus energy applied, the greater will be the maximum 
frequency of receptor impulse discharge (1, p. 116), 

6. Following the attainment of the maximum frequency of receptor 
discharge impulse, the stimulus meanwhile continuing to act unchanged, 
there ensu^ a progre^ive decline in frequency approximately according 
to a simple decay function. In some receptors, such as touch and those 
a^N^kted with hairs, the frequency quickly falls to zero; in others, such 
m pr^ure and those a^ciated with muscle spindle, the frequency be- 
mmm constant at a level wed above zero (i, p. 79). 

7, In case the impact of the stimulus eneigy on the receptor organ 
ceases before the point of maximum receptor-discharge frequency is 
leached, there is usualy a brief after-discharge which apparently may be 
prolonged under certain circumstances by self-propagating central pro- 

W), This latter peiseverative activity presumably declines as a 
simple negative growth function of the time since stimulus termination, 
tlie asymptote of this decline being zero (Figure 5, p. 43). 

We now add to this summary of empirical findings a formalized 
statanait of Kappauf and Schlosbergk hypothesis^; Other things 
cgiinl, the increment to the strength of a receptor-effector connec- 
tim i A»Hr) resulting from a reinforcement is an increasing func- 
tion of the frequmcy of the associated receptor discharge^ or the 
mtemity of the resulting afferent impulse, 

Om immediate concern here is with the implications of the 
Kappauf-Sehlosberg hy^pothesis and certain items of the receptor 
hnpiii^ smmary pre^nted above, namely, items 3, 4, 5, 6, and 7. 

L It follows from the above hypothesis and empirical item 3 
that m a rmnforcement situation there is a temporal relationship 
of ike acmAfioiied to the imconditioned stimulus such that as the 
of the tmmnditioned stimidus is progressively delayed, the 
mte of harming wiU marease, 

IL It follows from the hypothesis and empirical item 4, other 
ranaining constant, that in a r Enforcement Etuation the 

^ are in no way to be held accountable for the 

^ ihk fcMiuktMm; the author assumes entire r^on- 
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slmver the rate of application of the conditioned stimulm energy, 
the slower will be the rate of habit-strength acquisition, 

III. It follows from the hypothesis and empirical item 5 that 
in a reinforcement situation, temporal relationships of conditioned 
and unconditioned stimuli remaining optimal, within moderate 
ranges the greater the conditioned stimulus energy applied, the 
more rapid will be the rate of habit-strength acquisition, 

IV. It follows from the hj-pothesis and empirical item 6 that 
in a reinforcement situation as the time of the onset of the uncon- 
ditioned stimulus is further retarded beyond the point of optimal 
timing, the rate of learning will decline, but at a rate slower than 
the rate of rise during the recruitment period, the course of the 
decline following a negative growth function of the amount of 
delay, with asymptote appreciably above zero in the case of cer- 
t<xm receptors, 

V. It follows from the hypothesis and empirical item 7 that 
in a reinforcement situation as the time of the onset of the uncon- 
ditioned stimulus is retarded beyond the optimal amount, the action 
of the conditioned stimulus having ceased before the maximum rate 
of receptor discharge impulse is reached, trace conditioned reactions 
will be generated. In such eases the rate of learning w’ill decline 
according to a simple negative growth function of the amount of 
delayj with its asymptote at zero. 

We may now briefly compare the above deductions with the 
facts of habit-strength acquisition. Corollary III is in agreement 
with the empirical obser\'ations as reported by Pavlov (6 ) . As yet 
no evidence has been found concerning Corollary II. Corollaries 
I and IV are in good qualitative agreement with the empirical 
results of Kappauf and Schlosberg. At the best the above deduc- 
tions may constitute the beginning of a passage of the molar theory 
of behavior over into the ultimate molecular behavior theory based 
on neurophysiologjv; at the worst they may be no more than a 
harmless failure in the long trial-and-error history w^hich must 
precede the evolution of a true molecular theory of learned be- 
havior. 

HABIT SIBENGTH AS A FtJNCTIOK OF THE DURATIOI<r OF THE 
STIMULUS TEIACB AT THE TIME OF ACHON OCCUEBENCE 

The fifth corollary derived from the Kappauf and Schlosberg 
hyfH)th^is, since it concerns the rate of learning as a function of 
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the age of the stimulus trace which is contiguous with the reaction 
to W associated in the learning process, leads directly to the sec- 
ond factor whose relation to rate of habit-strength acquisition is 
to be considered in the present chapter. A study bearing directly 
on this point has been published by Helen Morrill Wolfle. 

In one of her two experiments in this field (12, IS ) , Mrs. Wolfle 
employed M human subjects in groups of ten, each group devoted 
to the" determination of the effectiveness of trace conditioned-reflex 
jyamirig with B particular temporal relationship of the conditioned 



Fifl. W. GrapMc repre^Btatioa of habit strength as a function of the 
tfniperml relation of tbe conditioned to the unconditioned stimulus in a 
timee conditioned inaction. Both curves are simple negative growth 
whi^ asymptotes are 6^. (From data published by Helen Morrill 

Wolltj if,) 

|0 Iht iHiecajditicHied stimulus. The former was a single sharp 
cliek^ ih% lattoj an electric shock to the hand. The reactions 
word^ were hand-withdrawal movements associated with shock 
tTOidance. At irregular intervals throughout the reinforcement 
the conditioned stimulus was presented without the accom- 
p^ving ehc^k. The measure of learning was the per cent of hand 
following th^ presentations. 

n^lts of tills inv^tigation are shown by the series of 
m Fipire A glance at these circles suggests a confirma- 
ti« ^ Omilary I, the maximum efficiency appearing 'when the 
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shock followed the click by a fraction of a second. This we shall 
call the point of optimal stimulm asynchronism. In this the trace 
conditioned reaction of Mrs. Wolfle agrees substantially with the 
delayed conditioned reaction of Kappauf and Schlosberg. 

Coroliaiy V is concerned with the hollow circles, which repre- 
sent learning efficiency when the shock, and so the reaction and 
its reinforcement, followed the click by a half second or more. An 
examination of this portion of Figure 39 reveals that as the onset 
of the shock is retarded beyond the point of optimal stimulus 
asynchronism the habit strength diminishes consistently, the rate 
of diminution decreasing as zero efficiency is approached, quite in 
agreement with the corollary. The progressive decline in habit 
strength on idiis side of the point of optimal learning efficiency 
we shall call the posterior stimulus asynchronism gradient 

In order to test the corollary in more detail a simple negative 
growth function was fitted to these latter valu^; this is repre- 
sented by the broken line passing among the hollow circles. Nob- 
'withstanding the usual de'viations, the circles fall fairly cloi^ to the 
line, which indicates a reasonably good fit. The limit of fall of 
this function turns out to be 6.5, a value appreciably above zero. 
In this respect empirical results appear to disagree with Corollary 
V ; however, the failure of this stimulus-asynchronism gradient to 
fall to zero may be due to sensitization effects, i.e., the effects of 
the shock alone quite apart from its association with the condi- 
tioned stimulus (4f p. 431). 

THE PEOBIiEM OF BACKWAED TOOTmONING 

^'Backward” conditioning is said to take place where the stim- 
ulus originally evoking the reaction, and usually the reaction itself, 
occur before the impact of the conditioned stimulus. This order of 
occurrence during the reinforcement process is called backward 
because it is the reverse of the order of occurrence when the 
acquired receptor-effector connection functions; in the latter situa- 
tion, of course, the stimulus must precede the reaction it evok^. 

At one time Pavlov (d, p. 27) regarded backward conditioning 
as impossible; later he refused this opinion (7, p. 381), holding that 
while backward conditioning is possible, the results obtainable by 
this procedure are very weak and unstable. Upon the “whole the 
latter view has been substantiated by more recent studies (9)f 
including two experiments by Mrs. Wolfle (IB^ 13). 
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The two extreme left-hand circles of Figures 39 represent the 
Ttmlis attained by backward conditioning. In this connection it 
may be noted that these two circles occupy a position on a smooth 
and consistent gradient descending from a point near that of maxi- 
mum learaing efficiency. We shall call this the anterior stimulus- 
m^nchmnkm gradient. This continuity suggests that so-called 
conditioning may be physiologically no more than a 
special and extreme case of a gradient antedating not the onset of 
the conditioned stimulus but, rather, the optimal phase of stimulus 
asTBchronism, In order to test the hypothesis a simple negative 
g^wih function w^as fitted to the four values represented by the 
four ^lid circle. This is shown graphically by the smooth curve 
drawn through these circles; the fit may be seen to be nearly per- 
fet Whether or not it 1^ a coincidence, the asymptote of this 
gradient is exactly the same as that of the values at the right, 
namely, 6.5 cent. As in the case of the posterior stimulus 
asynehroni^ gradient, the small positive asymptotic value may 
very well be due to sensitization effects (4, p. 831), 

As a final word concerning the data represented in Figure 39 
it may be added that the two gradients were extrapolated upward 
to where they intersect, on the a^umption that the point of inter- 
^tion would indicate indirectly somewffiat more precisely the 
optimal temporal relationship of the conditioned to the uncondi- 
tjoned stimulus. The outcome of this manoeuvre is shown graphi- 
cally in Figure 39. It suggests that under the conditions of Mrs. 
Wolfles experiment the conditioned stimulus should antedate the 
on^t of the unconditioned stimulus by .44 second if maximum 
laming efficiency is to be attained. It also suggests that had a 
group of mibjecto been conditioned at this interval the learning 
yield would have been 42.4 per cent of reaction evocations, an 
advantage over tiie value obtained at a delay of .5 

^ond. 

The r^ults of the above analysis are in agreement with a con- 
sdeimble bcMly of experimental evidence which indicates that for 
optimal learning to take place the conditioned stimulus should 
the on^t of the unconditioned stimulus by something less 
a ^If ^ccmd. A certain amount of speculation has arisen 
^^ding lie cau^ of tods now well-established fact. One of the 
of ti^ hy|K>th^^ has been put forward by Guth- 
rie iM 3 who ii^ K:^g^ted that in the learning situation the stim- 
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ulus actually conditioned to the reaction is the proprioceptive stimu- 
lation arising from reactions, usually implicit, evoked by the 
conditioned stimulus at the outset of the learning. This obviously 
ad hoc hypothesis accounts for the optimal time relationship of 
the onset of conditioned and unconditioned stimulation, though it 
fails to explain why proprioceptive stimuli should have a monopoly 
in the acquisition of receptor-effector connections over discharges 
from other important receptors such as the ear and the eye. It 
seems likely that the solution of the problem must await the future 
developments of neurophysiology'; fortunately this is not a neces- 
sary prerequisite to the development of a molar system of behavior 
theory- 


''SBiOWlf' VERSUS TRACE COISTDITIONED REACTIONS 

Intimately related to the gradient of habit strength as a func- 
tion of the age of the trace at the time of reaction occurrence is 
Pavlovs distinction between short and long trace conditioned re- 
flexes. In this connection Pavlov remarks (d, p. 40) : 

Trace reflexes may be of different character, depending on the length 
of pause between the termination of the conditioned stimulus and the 
appearance of the unconditioned stimulus. When the pause is short, 
l^ing a matter of only a few seconds, then the trace left by the condi- 
tioned stimulus is still fresh, and the reflex is what we may term a short- 
trace reflex. On the other hand, if a considerable interval, one minute or 
more, is allowed to elapse between the termination of the conditioned and 
the beginning of the unconditioned stimulus we have a long-trace reflex. 
. . , every^ stimulus must leave a trace on the nervous system for a 
greater or less time — fact which has long b^n recc^nk^ in physioic^ 
under the name of after effect. 

The referent of the term trace in the last sentence quoted above 
is evidently the same as that of the expression perseverative stim- 
ulm trace in the present work. \Ye thus arrive at the identification 
of ^Irs. Wolfle’s right-hand gradient (Figure 39) with Pavlov's 
^‘sliort" trace conditioned reflex. 

Pavlov is less specific about the basis for his ^flong'^ trace 
conditioned reflexes, though his experimental examples and asso- 
ciated remarks make the picture fairly clear (6, pp. 41-42) : 

This may be illustrated by the following detafled experiment of Dr. 
Feokritova: A dog is placed in a stand and given food regularly^ every 
thirtieth minute. In the control experiments any-one feeding after the 
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first few Is omitted, and it is found that despite the omission a secretion 
m’ith a eorr^pooding alixneniarv" motor reaction is produced at 
the thirtieth minute. Sometimes this reaction occurs exactly at the 
thiitith minute, but it may be one or two minutes late. In the interval 
there k not the least sign of any alimentary reaction, especially if the 
rmitme has 1:^x1 repeated a good number of times. When we come to 
Imk an interpreiation of these results, it seems pretty evident that the 
duration of time has acquired the properties of a conditioned stimulus. 

Wlrnt k the physiological mining of these time intervals in their role 
as conditioEed stimuli? , . . Time is measured from a general point of view 
by r^-stenng different cyclic phenomena in nature, such for instance as 
the and of the sun or the vibration of the pendulum of a clock. 

But many cyclic phenomena take place inside the animal^s body. 


In a word, the explanation of the striking outcome of Dr. 
Fec^ritova's experiment seems to be that each feeding of the dog 
initiate the stable intemai cycle of digestion which activated 
somewhat different receptoi^ at each of its phases. The receptor 
dis charges releteed by the phase reached after 30 minutes of 
digestion w'ere naturally conditioned to the salivary process evoked 
and reinforced by each subsequent feeding. The afferent process 
conditioned under such circumstances vrould not be a perseverative 
stimulus trace but, rather, the afferent impulse arising directly 
from receptor discharges. For this reason considerable confusion 
mi^t be avoided if reactions conditioned directly to stimuli aris- 
ing from such phy^ological cycles were called cyclic-phase condi- 
tioned reactiom. 

A eyclic-phase conditioned reaction involving another type of 
pbysiolo^cal C3"c!e may easily be set up (4, pp. 417-418). An 
el^tric shock may be delivered to a subject 30 times at regular 
intervals of a half minute. Each shock will cause the subject to 
i^et rather strongly, releasing various endocrine secretions and 
othemm up^tting his equilibrium. Presumably the body at once 
will to shift back to normal in much the same way after 

each Pr^umably also, each phase of this recovery cycle 

will activate a somewhat different set of receptors. Thus just 
telore the onset and cessation of each shock the same set of recep- 
toim will dischai^g as on the previous occasions and so will 
wmdition^ to the reactions evoked by the shock, e.g., the 
Ain motion. It naturally follows that if the shock is 
drohar^ of the^ receptors will evoke the reaction at 
iim imal of inoidence, much as if the shock were deliv- 
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ered. A record of such a cyclic-phase conditioned reaction is repro- 
duced as Figure 40.^ 

Let it be supposed that a stimulus incidentally becomes con- 
ditioned to a strong reaction, such as the one to shock, which in- 
volves a certain time for the return of the body to equilibrium. 



Pig. 40. Reproduction of a record showing a cyclic-phase eoBditioned 
reaction in a human subject. The tracing shows the reactions to the la^ 
four of ^ induction shocks delivered at 38.5-seeond intervals. Presumptive 
conditioned galvanic reactions to the temporal inten'al appear at X, Y, and Z. 
The vertical lines have been drawn to show points of simultaneity on the 
several tracing, (Reproduced from 4 P- 418.) 

Such a stimulus will necessarily evoke the reaction during the 
learning process, thereby initiating an internal beha\ior cycle 
which may be some minutes in length. A conditioned stimulus 
combination of this nature would yield a conditionable process 
which would extend far beyond the range of a true perseverative 

^Und^ the author^s direction, Mr. R. 0. Roui^ performed a modified 
form of this experiment, in which a i^ock was delivered every 30 ^conds. 
Seven of eleven subject displayed temporal conditioning in one form or 
another analogous to that shown in F%ure 40, some giving evidence of anti- 
cipatory or anxiety tendencies. One subject who gave clear indications of 
^^temporal” conditioning admitted, upon sut^quent questioning, that he 
had counted; none of the other objects reported having done this. 
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receptor discharge, and might easily give rise to what would super- 
ficially appear to be trace conditioned reactions in which the 
stimuli would be separated by many times three seconds. This 
presumptive mechanism may explain the paradox that certain 
studies, such as those of Warner (11) and Yarbrough (14), purport 
to show reactions conditioned to perseverative stimulus traces in 
which the supposed ^‘trace’^ at the point conditioned may be as 
old as twenty seconds. 


SUMMARY 

Numerous experiments have shown that, the gradient of rein- 
fortement remaining constant, the most favorable temporal ar- 
rangement for the delivery of the conditioned and the uncondi- 
tioned stimuli is to have the latter follow the former by something 
l^s than a half second. But as the asynchronism of the onset of 
the two stimuli deviates from this optimal relationship in either 
direction there is a falling off in the habit strength which will 
r^ult from a git^en quality and number of reinforcements, the 
rate of decline in each direction probably being a simple negative 
growth or decay function of the nature and extent of stimulus 
aeynchroni^. The situation where the onset of the unconditioned 
^imulus, as well as the reaction and its reinforcement, antedates 
the optimal relationship by more than a half second or so includes 
two special cases which are traditionally known as dmvltaneom 
eondiiioning and backward conditioning. These are believed to be 
portions of the same physiological continuum antedating the point 
of optimal stimulus asynchronism. This yields the anterior stim- 
ulus-^mchionism gradient. 

The ca^ where the unconditioned stimulus falls later than the 
optimal relationship has been studied somewhat more than the one 
in which it this point. This develops two posterior gradi- 

ents of stimuliis asynchronism, depending on whether the condi- 
tional gtimulus terminates early or continues on to overlap the 
unconditioned stimulus. In both events, habit strength declines 
mait^y as the delay in the onset of the unconditioned stimulus 
t^yond the optimum, the diminution being a simple decay 
filiation of the amount of delay beyond the optimal relationship. 
If the ^aditionM stimulus persists, the resulting habit is said to 
te a ifehiyadf cmditicmed reaction. The rate of fall in habit 
strmfth as a function of increased asynchronism is moderate, and 
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the limit of the decline probably has a value considerably above 
zero. This level is believed to represent the status of static ele- 
ments in stimulus complexes. 

In ease the conditioned stimulus is instantaneous the resulting 
habit is said to be a trace conditioned reaction, the rate of fall in 
habit strength is relatively great, and the limit of decline in the 
gradient when experimental artifacts are eliminated probably is 
zero. It is doubtful if true trace conditioned reflexes can be set 
up when the onset of the unconditioned stimulus follows the termi- 
nation of the conditioned stimulus by more than about three 
seconds. 

Both posterior-as^mchronism gradients are tentatively regarded 
physiologically as increasing functions of the magnitude or inten- 
sity of the temporally contiguous afferent discharges. These in 
their turn are believed to be increasing functions of the frequency 
of impulses given off by the receptors. The gradient of afferent 
discharge intensity in the case of delayed conditioned reactions is 
supposed to arise from the continuous action of the conditioned 
stimulus upon its receptor; that in the case of trace conditioned 
reactions is thought to be a mere perseveration or trace of the 
afferent action after the conditioned stimulus which originally 
initiated it has ceased to act on its receptor. In the case of the 
continued action of a stimulus energy- on a receptor, the s presum- 
ably consists of the afferent discharge arising in the receptor at a 
given instant, plus the perseverative traces arising from the stimu- 
lation during preceding instants. 

The cydw-phase conditioned reaction superficially resemble 
the true trace conditioned reaction. In such situations the ^^stimu- 
lation’’ involves, in addition to a mere receptor discharge, the 
setting in motion of a major physiological cycle such as that of 
digestion or the return to equilibrium after an electric shock. If 
the stimuli activated by a particular phase of such a cycle are 
regularly conditioned to the reaction in question, this reaction will 
later be evoked at that phase of the cycle, and consequently at a 
certain time following the onset of a stimulus associate with the 
initiation of the cycle. Stimuli evidently originating in the diges- 
tive cycle in do^ have evoked conditioned reactions as much as 
half an hour after the last preceding significant stimulation. The 
resuliB of such ^Temporal” conditioning have somewhat mislead- 
ingly been said to yield *flong” trace conditioned reactions. 

The considerations put forward in Chapters VI, VII, VIII, IX, 
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X, and the present one enable ns to formulate our fourth primary 

prineiplei law, or postulate: 

POSTULATE 4 

Whenever an effector activity (r R) and a receptor activity (S s) 
(Ktm In close temjK)ral contiguity (iCr), and this iCr is closely associated 
with the diminution of a need (G) or with a stimulus which has been 
dc^ely and consistently associated with the diminution of a need (G), 
there will resifft an increment to a tendency (AsHr) for that afferent 
on later occasions to evoke that' reaction. The increments from 
^Mxesdve reinforcenrents summate in a m a n ner which yields a com- 
Mncd habit strength (sSr) which is a simple positive growth function 
of the number of reinforcements (N). The upper limit (m) of this 
oufTc of learning is the product of (1) a positive growth function of the 
magnitmle of need reduction which is involved in primary, or which is 
asscHsated with secondary, reinforcement; (2) a negative function of the 
delay (t) in reinforcement; and (3) (a) a negative growth function of 
the degree of asynchronism (t') of S and R when both are of brief dura- 
tion, or (b), in ca^ the action of S is prolonged so as to overlap the 
l^^mning of E, a negative growth function of the dmration of the 
continuous acticm of S on the receptor when R begins. 

NOTES 

Mathematical Statement of Postulate 4 

stateMnt of Postulate 4 is distinctly more concise, con- 
and informatiire than is the verbal formulation, given above. It has 
mvcml cmm c^>ending on whether (1) S is prolonged and overlaps the beginning 
of R femporally, and (2) their d^ee of asynchronism in case they are of brief 
damtkHi and not overlap temporally. As an illustration of the case where R 
fifiows the continuous action of S on the receptor for duration £% 

'm have ttie fdl wii^ equation : 

sEm =» M (1 - (1 — (16) 

wtee, 

M *= IIH the physMc^eal maximum of habit strength; 
e » a amttematieal constant usually taken in the present work as 10; 

w « ft coaMast change in a measurable objective criterion which results in a 

reliction , 

I « tM d^y in rmif<^roenient; 

^ r* — f!i — wheiu S k of mcae tiiiau institntaneoiK duration and over- 

^ erf R; 

Tm * tii» of Ae <rf R; 
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T’^ = the time of the tx^ginrJeg of S; 
.V = the iiu.ml>er of reinforceniente : 
i*, ]\ % and i = empirical eonstanta. 


Trie meaning of equation 16 may 1:^ clarified by the following example. 
Lei it be supposed that we Imve a simple leiiming .situation in which ^ rein- 
forcemcruis are gi’/en hV = 20); tliat 5 grams of a standard food axe given a 
canine .subject at each reinforcement (w = 5); that the reioioreement is given 
3 seconds after the aC* H = 3} ; and that S liegias a continuous action on the 
receptor 2 f^eonds before the bc^ginning of R (1^ = 2). Taking the values of 
k, j) u, and i from previously given equations (11, 14, 20, and 6) fitted to empirical 
data, and sutetituting, we have, 

sHe = 1CK)(1 - ^ S)io-^c<«'2 K s X 10** -ur2-.6r.(i _ i 0 “’ 5 is 2C), (17) 

Solving by easy stages, we have, 

sHr = 100(1 - 5 ^ X 10475 X 2.1723 “ 10“ «■<“) 

= 1C«(.3562)(1 - 10--e«x»}) 

= 35.62(1 - 10-M8x»), 

which presents the familiar equation expr^ing the positive growth curve ot 
simple learning. Solving further, w’e have, 

sHs ~ 35.62 X .^35 
= 20.1 habs. 

In case the learning situation is the same as abo\"e except that both S and E 
are of brief duration, -S preceding R by .9 second, the equation becomes : 

sHs = 10(1 - 10-1^^5)10~.CX^2xS X 10-^x.182(.9--.44)(1 _ IQ-MZXW)^ (Ig) 

.V s^E — 12-5 habs. 

In case the learning situation is the same bs above except that R preo^^ S 
by one-tenth of a second, we have: 

sHb = 10(1 - 10--i53 X 5)10-.&0672 X 3 X 10^1-Q68C-.1-.44)(1 _ (19) 

sSs == 11-0 hate. 


The Equation of the Curve of Figure 3S 

The n^ative growth function fitt^ to the valu^ repre^nted by the hollow 
dreks of Figure 38 is; 

y = 62.6 X 10--25l6Cr~i) 4- ^.4 (^) 

where y is the pa: cent of antedating reactions and n is the time from the banning 
of the conditioned stimulus to that of the unconditioned stimulus, the reaction, 
and the reinforcement. The number 30,4 represents the a^mptote. 
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The Eqnationi of the Cur\'es of Figure 39 

The Eegative growth fiinetion fitted to the hollow circle at the right of Figure 

39 k: 

^ = ms X -f 6.5 (21) 

where p m the per cent or probability of conditioned-reaction evocation on the 
tet trto and x h the tim at which the reaction occurs (Tr) less the time at 
which the stimulus cx^cuis (Th); in case R precedes S, x will, of course, be nega- 
Hwe, The number 6.5 represents the asymptote. 

native grcwth function fitted to the values represented by the solid 
at ti^ Mi mdb of Figure 39 is: 

^ = 22-5 X + 6.5, (22) 

in which fj and the value 6.5 mean the same as in the equation fitted to the right- 
gr^dieiit, and the values of z must be less than .44 second. 

Prolmbly a K>iiwhat more general and significant manner of writing the 

above ^uations is as fdlows: 

p = 35.9 X 10-^ - 6.5, (23) 

is which, 

p = the per cent or prolmbility of conditioned-reaction evocation; 
r .44; 

fa = time of the instantaneous occurrence of R in seconds; 
fj = tile tiiM of the instantaneous occurrence of S in seconds; 
f - — I.06S if f is native, but -b 1.182 if is not negative. 

Ultimately of course, the valu^ of p in equations 20, 21, 22, and 23 will need to 
be ooiverled into units of amount on the basis of the normal probability fimction 
(tee p. 311 fi.), which will presumably change the equations in question as well as 
forn^ erf Figure 38 and 39. Despite this defect it is believed that these 
equations and %ures have a certain amount of expository value as well as sug- 
agnifi^nce for further developments. 

Stkiifiiis-A^Tiekronism Gradients and the Parameters of the Curve 
of Habit Acquisition 

T^ the rdation of the three stimulus-asynchronism gradients of 

conditioning to the slope and asymptote of the curve of habit-strength acquisition 
m a function of numb^ of reinforcements, arises here just as we have seen 
laimSel ar^ in connection with the amount of the reinforcing agent 

emi^oyed and gradient of reinforcement. It is clear that a satisfactory 
of toraiag and of behavior requires a knowledge of this relatioi^ 

•hip in al mentioned. Unfortunately no relevant evidence has b^n 

foinMi which k mSeient to yield a dedsive indication as to the relationship in 
^ Gi sfertulcm-asynchronism gradients. However, largely in order to 
bdcce i^ctents of behaview* principle, we pc^tulate that it is 
mm wv^al of tiie asymptote of habit strength (m) with un- 
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The Derivation of the Equation Expr^ng Pc^tulate 4 


It is assumed that. 



If' = M (1 - e-^). 

(12) 

either 


(14) 


m = 

(24) 

where (20) 

s 

1 

1 

11 


or (23) 

m s= 

(25) 

and 

11 

1 

0 

1 

(26) 


where M, M', m', and m are respectively: the absolute upper physiological limit 
of habit strength with unlimited reinforcement, the upper Mmit of habit strength 
as determined by the nature and amount of the reinforcii^ agent employed, the 
limit of habit strength as detennmed by the delay in reinforcement, and the limit 
of habit strei^h as determined by the d^pree of receptor-effector asynchronism. 

Now, ®ibstituting equation 24 (or 25) in equation we have, 

bHs = - 10-^D (27) 

Sufcetituting equation 14 in equation 27, we have, 

bEm = — 10”*“D (28) 

Substituting equation 12 in equation and recalling that M = 100 habs, w'a 
have, 

bBb = 100(1 - c-*»)e-’* e-^(l - 10“*’^0, 

which is one of the alternative equations sought (16). 

It is alt<^ther probaWe that tixare are other important factors which enter 
into the determination of habit strength in lesuning or eonditionii^ mtua^ 

tions. Two of these are the intensity of tire conditioned stimirius (5) and the 
vigor or intensity of ti^e r^wrtion (R). Sp^afic reswches analc^cwis to the studies 
of Gantt, Perin, Williams, Hovland, and others cited in the prec^ng |mges need 
to be performed on these probable factors before their relationship to habit 
strength can be postulated with much confidence. In the end a grand investiga- 
tion involving the finding of the joint reaction potentiality of aU the pr^umptivc 
determining factors when taken in all combinations of the repr^ntative values 
of each (after the manner of Penny's stiidy of habit streogth and motivation, 5) 
most be carried out before a really dependable equation for Postulate 4 can be 
written. The point is that habit strength as a function t whai w and f are 
at a (^rtain value, k nc^ nece^arily tire san^ as it will be wten to and f 
are coi^tant at other values. This will be a hu^ task, Imt the cnitcome shouM 
be woitia the labor invoIvaL It seems unlikdy that the Fisher-di^gn type of 
esp^EimKit win yidid dependaHe indications of the complex hyperspatial curva^ 
tares which almost certainly wili be found. M^nwfaile/ the equations giv^ 
above may ^rve as pcants of <]^mrture foe furth^ empiriml analyse. 



i82 


PRINCIPLES OF BEHAVIOR 


REFERENCES 

1. Amian, E. D. The bam oj senmiion. New York: W. W. Norton and 
‘ * Co,* 1^. 

2. Qmabam, C. H. Vision: III. Some neural correlations. Chapter 15 in 

Handbmk of general experimental psychology, C. Murchison, editor. 
Wor^ster, Mass.: Clark Univ. Press, 1934. 

3. GiTiiaiE, E. R. Conditioning as a principle of learning. Psychol, Rev., 

1930, S7, 412-4^. 

4. Hull, C. L. learning: II. The factor of the conditioned reflex. Chap- 

ter 9 in Handbook of general experimental psychology, C. Murchison, 
editor. Worcester, Ma^.: Clark Univ. Press, 1934. 

5. K.4FPAIJF, W. E., and Schlosbebg, H. Conditioned responses in the white 

rat. III. Conditioning as a function of the length of the period of 
delay, i. Genet. Psychol., 1937, 50, 27-45. 

6. PAfiiw, I. P. Conditioned reflexes (trans. by G. V. Anrep). London: 

Oxfc^ Univ. Pre^, 1927. 

7. P. 4 \ii>v, 1. P. Leciwres on conditioned reflexes (trans. by W. H. Gantt). 

New York: International Publishers, 1928. 

8- PsiN, C. T. ]^havior potentiality as a joint function of the amount 
of training and the degree of hunger at the time of extinction. J. 
Exper. Psychol, 1942, m, 93-113. 

9, Rc^ENBLtTETH, A. Central excitation and inhibition in reflex changes of 
hc^rt rate, Amer. J. Physiol., 1934, 107, 293-304. 

10. SwiTZM, S. A. Backward conditioning of the lid reflex. J. Exper. 

P-tychoU 19^, 15, 76-97. 

11. W.AENEE, L. H. The association span of the white rat. J. Genet. Psychol., 

19^, 57-90. 

12. WoLfLB, H. M. Time factors in conditioning finger-withdrawal. J, Gen 

Psychol., 19^, 4, 372-379. 

13. Wcaji.E, H. M. Conditioning as a function of the interval between the 

conditioned and the originai stimulus. J. Gen. Psychol., 1932, 7, 80-103. 

14. YmmwGn, J. U. The influence of the time interval upon the rate of 

kaming in the white rat. Psychol Monog., 1921, 50, No. 135. 



CHAPTER Xn 


Stimulus Generalization 

The preceding chapters have shown that learning takes place 
according to various principles of reinforcement. In giving this 
account we have followed the conventional practice of character- 
izing learning as the setting up of receptor-effector connections. 
Moreover, we have represented these connections by such symbols 

as sUb, $Ht, and S *s >r — *R, which specify only the 

receptor and effector processes actually involved in the reinforce- 
ment. It is now necessary to point out that while there is every 
reason to believe that each reinforcement does result in the con- 
nection represented by the sjunbolism, the actual outcome is much 
more complex than this. The fact is that every reinforcement 
mediates connections between a verj' great number of receptor and 
effector processes in addition to those involved in the reinforcement 
process and represented in the conventional symbolism bHr. Sev- 
eral groups of such additional and indirectly established receptor- 
effector connections may be distinguished: 

1. The reaction involved in the original conditioning become con- 
nected with a considerable zone of stimuli other than, but adjacent to, 
the stimulus conventionally involved in the ori^nal conditioning; this k 
called stimulus genercdization. 

2. The stimulus involved in the original conditionii^ become con- 
nected with a considerable zone of reactions other than, but related to, 
the reaction conventionally involved in the original reinforcement; this 
may be called response generalization} 

3. Stimuli not involved in the original reinforcement but lying in a 
zone related to it become connected with reactions not involved in the 
original reinforcement but lying in a zone related to it; this may be called 
stimulus-response generalization. 

The present chapter is particularly concerned with certain phe- 
nomena of stimulus generalization and some of their implications 
concerning adaptive behavior. 

^The present analysis indicates that response generalization is a rather 
complex secondary phenomenon; s^cs is not available in the present work 
for an adequate treatment of it. 
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paiMABY STIilCLfS-atTAIilTT GENEElAUZATION 


s, 

s 




Th^ molar principle of primary stimuliis generalization is now 
well established both qualitatively and quantitatively, mainly by 
conditioned-reaction experiments, A few of the indirectly acquired 

receptor-effector connections set up by 
S 3 the conditioning of So to R are repre- 

sented in Figure 41 as originating in the 
S’s with integral subscripts. Since all 
of these potentialities of reaction evoca- 
tion converge from different stimulus 
possibilities upon the same reaction, 
stimulus generalization may be said to 
generate a receptor-effector convergence. 

Under certain circumstances, e.g., in 
long trace conditioned reactions, gener- 
alization normally extends into receptor 
modes other than that involved in the 


R 




03. 


Fm. 41. Diagram of re- 
ceptor-eifeetor convergence 
ariaisg from the primaiy 
gtimuius generalization set 


up concurrently with the 
conditioning of sH^. Si, 

S% represent positions at 
progressively greater dis- 
tances on one of on 
a Gse-dimensional stimulus 
coatiainim, md -Ss', Ss' 
reprint eorr^onding po- 
sitions on the other side of 
Si on the ^me stimulus 
eontinuuiB. In this notation, 
represents the stimulus 
when eoBsidered as im'olved 
in the reiniorcement proc- 
ess, amd Si, 8$, etc., 
Mimuli when ^JE^dered 
evoking rmetmn, ^ and Sn 
mm undci^ocMl to fail at the 
mmm point on the stimuius 


reinforcement. Thus Pavlov reports the 
case of a defensive salivary reaction in 
a dog conditioned to the trace of a tac- 
tile stimulus, the reaction in question 
subsequently being evoked by a thermal 
stimulus of 0° centigrade (if, p. 113). 
It is important to note that while pri- 
mary stimulus generalization may pa^ 
the boundaries of a given sense mode, 
this is not usual. Stimuli conditioned to 
one sense mode will ordinarily generalize 
only to other stimuli in the same sense 
mode, e.g., from one auditory vibration 
rate or intensity to another, or from one 
light wave length or intensity to another. 


In general the more remote on the stim- 


iijiis continuum the evoking stimulus (S) is from that originally 
conciitioiied (Si , the weaker will be the reaction tendency mobilized 

by it 

Tlie quantitative law of primary stimulus generalization is 
Bt^ly by an experiment reported by Hovland (4 ) . This 

investigator wmdiliimttl the galvanic skin reaction in human sub- 
lets to a pine ixme, e.g., of 1,967 cycles per second, and then meas- 
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iired the amplitude of the reaction evok«i at this pitch and at three 
other pitches separated from each other by an equal number of 
discrimination thresholds (25 j.n.ddsf. The pooled results from 
twenty subjects are shown in Figure 42. There it may be seen 
that: 

1. The amplitude of galvanic skin reaction diminishes steadily* with the 
increase in the extent of deviation (d\ of the evocation stimulus (Sj from 
the siimuliis criginally conditioned iS). 



mfFDecES omm nm pom or iciNnHmHT fl)) 

Fig. 42. Empiricai generalkation gradient of conditioned galvanic skin 
raaction derived from data published by Hovland (^). Note that the gradient 
extends in lx)th directions on the stimulus continuum (vibration rate) from 
the point originally conditioned. 

2. This diminishing generalization tendency extaads symmetrk^y in 
lK)th directions along the stimulus dimension. 

3. The quantitative course of the diminution in the generalization 
tendency approximates mther clo^y a negative growth function of the 
amount (d) that S deviates from S as measured in discrimination thr^h- 
oids (jji.d/s). This is attested by the clc^n^ with which the smooth 

reprinting a simple decay function fitted to the generalization 
data, approximate the circle of Figure 42. 
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4. The asymptote or lower limit of the generalization gradient falls 
at 12.3 millimeters, rather than at zero. The failure of the gradient to 
approach zero as a limit is regarded as an experimental artifact due in 
part to the fact that previous to conditioning this reaction is evokable 
in arjureciable amounts by any stimulus of even moderate intensity, in 
pan* to sensitization, and in part to the reaction becoming conditioned 
somewhat to the static stimuli arising from the experimental environment 
The en\iroiiniental portion of the stimulus situation, of course, remains 
constant throughout the changes in the tonal stimulus, which alone pro- 
duce the gradient. Accordingly it is concluded that the asymptote of the 
true generalization gradient is probably zero (8). 


PBIMAET STTMULUS-INTEi^rSITY GENERALIZATION' 

In a second study (5) employing the same general apparatus 
arrangement, Hovland attempted to determine the quantitative 
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Fte. 43. Empirical stimulus-intensity generalization gradient of the con- 
tftioned galrame skin reaction plotted from data published by Hovland (5). 
Kote that while the gradients slope downward with increasing degrees of 
^•ration from the point originally conditioned, the steepness of the slope is 
mtiactly limii that of the stimulus-quality generalization shown in 

law of Btanulus-intensity gsieralization. The procedure was to 
a ©vrai inteisity of a simple sinusoidal sound waye to 
the ©avjaik skin reaction, and then test other intensities of the 
raim im^amcj at 50 j Ji.d. intervals. While the results of general- 
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ization were seriously complicated by specific effects of intensity 
wiiieh were quite independent of generalizationj it is believed that 
these latter effects were substantially eliminated by pooling the 
results of two groups of subjects, one in which the reaction ampli- 
tude was determined for intensities greater than that conditioned 
and one in which the determination was made for intensities which 
were less. The outecme of this experiment is shown by the circles 
in Figure 43. A negtitive growth function has been fitted to these 
%'alues; this is represented by the smooth curve running among the 
circles. The fit is reasonably good. 

It is evident from an inspection of Figure 43 that stimulus 
intensity also manifests a generalization gradient, but that the rate 
of fall of the gradient per j.n.d. of detdation from the point con- 
ditioned is distinctly less than is that for stimulus-quality' general- 
ization as sho^m in Figure 42. The latter has a fractional rate 
of decremental change per j.n.d. of deviation from the point con- 
ditioned of approximately 1/33, whereas the former has an F- value 
of approximately 1/77. 

THE CONCEPT OF EFFECTIVE HABIT STRENGTH {sHr) 

From the foregoing it is evident that the simple notion of 
habit strength, as indicating merely the strength of connection 
betw'een the stimulus and the reaction involved in the original 
reinforcement process, must be radically expanded before the in- 
fluence of learning on functional activity is to be understood and 
represented in a realistic manner. It is true that the various prin- 
ciples of reinforcement when perfected will presumably enable us 
to predict with precision the strength of the connection between 
the conditioned stimulus and the associated reaction. This is all 
right so far as it goes, but it represents only a small portion of 
the zone of reaction evocation potentialities set up by a given 
reinforcement. The strength of the connections at the other points 
of the zone can be determined only from a knowledge of the strength 
of the receptor-effector connection {bHb) at the point of reinforce- 
ment and the extent of the difference (d) between the iK>sition of 
the conditioned stimulus (S) and that of the evocation stimulus (S) 
on the stimulus continuum connecting them. Thus there emerge 
the concept of functional 0 T_eJfective habit strength, which we shall 
represent by the symbol sHr. This symbol will be used to desig- 
nate the habit stoength throughout the entire zone of habit forma- 
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tioa which is set up by a given reinforcement process, or, with modi- 
SeatioDS according to conditions), the summation of the 

effects of two or more reinforcement processes. The symbol 
will be reserved, as before, to indicate the strength of habit at the 
point of reinforcement^ i.e., when d = zero, 

sHjs = S^B» 

THE CONCEPTS OF STIMULUS DIMEISTSIOFT AND AFFERENT 
GENEElALIZATIO]Sr CONTINUUM 

It is clear from the now familiar causal relationship S — 

>R that in the evocation process (1) the stimulus 

energy (S) determines which receptor shall be activated and the 
weasion of its activation, (2) the nature of the receptor thus 
activated determines the detailed characteristics of the receptor 
discharge (a), and (3) the reaction (R) is only indirectly a func- 
tion of 5 by \irtue of the fact, and only to the extent, that a stands 
in a one-to-one relationship to S. There is reason to believe that 
this parallelism is never exact and that certain factors such as 
afferent neural interaction may produce marked deviations. 

Th^e considerations have definite implications for certain phe- 
nomena of stimulus generalization. Thus it is clear that there can 
be no primary stimulus generalization unless there is some parallel 
physical variability in the stimulus energy to serve as its basis; 
for example, there could be no generalization in the stimulus dimen- 
sion of frequency or amplitude of sound waves if sound waves did 
not pr^ent such dimensions of variability. Secondly, generaliza- 
tion cannot take place on a given stimulus dimension if the relevant 
re^ptor dc^s not respond differentially to variability in that dimen- 
®oa ; for example, organisms which are color blind, i.e., those whose 
raptors do not yield differential responses to variations in 
wave length of light, can hardly be expected to show a generaliza- 
tion gradi^t along this stimulus dimension. Accordingly there 
emei^ the contrasted concepts of stimidus dimension and afferent 
gmemlizatim continuum, the latter being the differential afferent 
jn^pon^ (si eorr^ponding in varying degrees to variation in a 
pvim stimulus continuum. With few exceptions the receptors of 
Miwial organisms ■ appear to yield afferent ■' generalizatioii 

for all the physical stimulus dimensions to which they 
r^Ksd at ail. Ckm'em^y, for ev^ empirically observed primary 
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generalization dimension, some measurable dimension of the physi- 
cal situation has usually been found. 

Since most stimulus energies %*ar%* in more than one dimension, 
it comes about tluit any given bit of learning is likely to set up 
generalization gradients along several continua simultaneously. In- 
asmuch as the relevant stimulus dimensions may vary^ independ- 
ently, the resulting mixture of several generalization gradients in a 
single generalization situation often greatly complicates the inter- 
pretation of experimenta! results involving generalization. This is 
particularly true in the field of vision, where there may be com- 
bined the stimulus dimension corresponding to white light and the 
innumerable afferent generalization continua arising from the 
simultaneous combinations of two or more wave lengths. To com- 
plicate matters still further there is the neural interaction (p. 42) 
of piw^es going on in different parts of the retina, wdiich makes 
the afferent discharge (I) of a given retinal element not merely a 
function of the stimulus energy (S) impinging on it but al^) of the 
energies *S', S'% etc., impinging on neighboring element at or about 
the same time. This complication becomes esj^ially apparent in 
figured or spatially patterned stimulus situations in 'which there 
may emerge such generalization continua as degree of cuiwature 
or angle of outline, size of figure, brightness, contrast or difference 
hetw'een portions of the area stimulated, the angle of rotation of 
the figure, and so on. When numerous physical dimensions are 
mixed in various ways and, particularly, w’here interaction occurs 
between different parts of the retina, the nature and amount of the 
generalization effects are extremely difficult to pr^ict, as the exten- 
sive experimental investigations of Lashley have shown (10). There 
is reason to hope, however, that these problems will finally yield 
to the joint and systematic study of primary generalization gradi- 
ents and gradients of afferent neural interaction. Certain investiga- 
tions of the Gestalt psychologists should prove valuable in clarify- 
ing the latter type of problem, 

IN* WHAT irNTTS SHALL THE DIFFEEENCE BETWEEN THE COHDI- 
TIOKED STIMULUS AND THE FVOCATIOH STIMULUS IN 
PRIMABY GENERALIZATION BE MEASURED? 

Discrimination mvestigations show with considerable clarity that 
while s is a function of S, the relationship is usually by no means 
linear, and fr^uently it is not even monotonic. A well-known 
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example of the latter type of irregularity concerns the octave in 
auditorj' wave frequency. There is a tendency for generalized re- 
aeiiona to evoked more strongly by stimuli which are even 
multiples of the frequency originally conditioned than by certain 
intennediate rates iP), a fact w-hich is in conflict with the prin- 
ciple represented by Figure 42. The present a priori unpredictability 
in the character of many receptor responses as a function of the 
several stimulus dimensions is presumably due to our ignorance 
regarding the physiolog;^’ of the receptors. Because of these irregu- 
larities it will probably be impossible to represent all generalization 
gradients m any uniform function of the stimulus dimensions in- 
Tolir^ 

There remains, however, the possibility of expressing general- 
ization gradients in terms of distances on the afferent generalization 
continuum. The natural imit of measurement on this continuum 
is the discrimination thr^hold, or J.n.d. This is a difference be- 
tween two stimuli on a given stimulus dimension (the other dimen- 
®ons remaining constant) such that at the limit of discrimination 
training the organism vrill consistently give differential reactions 
to the two stimuli on 75 per cent of the trials. Presumably because 
the process of discrimination involves the joint effect of variability 
in the stimulus dimension and the corresponding afferent reaction 
of the x^eptor, the generalization gradients in general appear to 
be rather accurately and simply expressible as negative growth 
functions of the stimulus dimension when transmuted into j.n.d. 
unita 

(^ViRAlUZATIOISr BY MEANS OP IDENTICAL STIMULUS 
COMPONENTS 

Th^ may now be mention^ a second form of stimulus gen- 
that which aris^ indirectly because conditioned stimuli 
are not simple but are normally compoimded of the simultaneous 
discharfe of a very great number of distinct receptors. Let it be 
for oample, that a salivary reaction has been condi- 
tion^ to a com|K>und stimulus consisting of a group of sound waves 
prcriucM by an or^n pip^ (iSu) and a group of light waves pro- 
di»dl by an filament (S*,), and that each of the two 

rtwBulis has independently acquired a superthreshold 

ptox^bty fcr evoldi^ the reaction. Now, if a second stimulus 
atoiatkii of tae vibrations produced by the organ pipe 
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(Sa l were presented either alone or in combination with a cutaneous 
vibration iSch say, the reaction would be evoked, owing to the 
presence in the second stimulus situation of the originalh" condi- 
tioned auditoiy^ component (Sa*. The operation of this principle 
has been investigated by the author at a gross molar level in con- 
nection with an experimental study of generalizing abstraction or 
concept formation ^ ; it has also been utilized extensively by 
Thorndike (iJi in the explanation of certain forms of training 
transfer. 

It is tempting to assume with Guthrie (B) that all primary 
generalization is built on this model. Involved in such an hypoth- 
esis there is the implicit assumption that the afferent discharge 
initiated by every stimulus energy consists of a large number of 
afferent “molecules/’ and that the contiguous receptor discharges 
on any given stimulus continuum have a considerable portion of 
their afferent molecules in common while differing with respect to 
certain others. Thus one receptor discharge might consist of the 
molecules a, 6, c, d, e, /, g, whereas the adjacent one on the afferent 
generalization continuum would consist of the molecules 6, c, dj e, 
/, g, h, the next one would consist of c, d, e, /, g, h, f, and so on. 
One discrimination threshold on such a continuum would be the 
physical measure of the qualitative or quantitative variation in the 
stimulus energt- which would change enough afferent molecules so 
that a subject, at the limit of training, would react differentially 
to the tw'o stimuli on 75 per cent of the trials. It is quite possible 
that something of this nature will turn out to be the ultimate 
physiological explanation of primary generalization. As yet, how- 
ever, proof is lacking on the molecular level, and there seems no 
immediate prospect of securing a critical test of the hypothesis. 
Meanwhile w^e must get along as best w-e can with a molar analysis 
based on empirically determined functional relationships, e.g., those 
presented graphically in Figures 42 and 43. 

SE<X)NDARY STIMULUS GENERALIZATION 

Except imder certain special circumstances, such as those of 
sensitization (7, p. 431) or “long” trace conditioned reactions, con- 
ditioned stimuli probably do not show generalization into other 
receptor mcKies. Yet we may recall toe name of a person wito 
alK>ut equal probability on seeing either his face or toe back of 
his head, at the sound of his voice or even Ms footstep. Such be- 
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hAvioT presumably comes about not through primary stimulus gen- 
eralization, but indirectly through each reaction being specifically 
learned. For example, a salivary reaction may be conditioned to 
a tactile \ibration, to an auditory vibration, and to a flash of light. 
Since each of the three stimuli leads indifferently to the salivary 
reaction, this type of habit organization constitutes a learned recep- 
tor-effector convergence, quite distinct from the convergence pro- 
duced by primary gen- 
eralization (Figure 41). 
Such a learned converg- 
ence is represented dia- 
grammatically in Figure 
44. 

Receptor-effector con- 
vergence is of particular 
importance in behavior 
theory, since it appears 
to be a medium of the 
automatic transfer of 
training effects. It is significant in the present context because, 
as a special case of such habit transfer, it seems to mediate what is 
known as secondary stimulus generalization. 

An apparent ca^ of secondary stimulus generalization has been 
reported by Shipley (li) and verified by Lumsdame (5, p. 230). 
These investigators presented a subject with a flash of light fol- 
low^ by the tap of a padded hammer against the cheek below the 
eye, thus conditioning lid closure to the light flash. Next, the same 
subject m|:^atedly given an electric shock on the finger. This 
evoked not only a sharp finger withdrawal from the electrode, but 
lid clc^ue m welL Finally the flash of light was delivered alone. 
It was fdind in a considerable proportion of the subjects of both 
Qi^riineiite that during this latter manoeuvre the light evoked 
fin^r redaction mm themgh the former had never been associated 
with the shxdc or ttw finger retraction. The interpretation 

is that the evoked the lid closure, and the proprioceptive 
rtlmiilatloii prcKiuc^ by this act (or some other less conspicuous 
at the same time) evoked the finger retraction, 
photographic records (S, p. 231) of the process 
to toe view toat in this esj^fiment the wink reaction 

m m toice they show that, typically, when 

evt^ed finger refametion toe lid closure usually took place 


TOUCH 


SOUND SALIVATION 


LIGHT 

Ficj. 44. Diagram of a specifically learned 
converfent eseitatory mechankm. Each stimu- 
lus is a^umed to have been conditioned to the 
olivary ruction on a ^parate occasion. 
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between the flash of the 
light and the finger 
movement. Oceasionaliy, 
however^ the two reac- 
tions occurred at the 
same time, and some- 
times the finger move- 
ment even preceded the 
blink. This, of course, 
could not have happened 
if the finger retraction 
was evoked by the pro- 
prioceptive stimuli aris- 
ing from the lid closure. 
It is possible, however, 
that numerous other re- 
actions were conditioned 
at the same time as the 
wink, and that proprio- 
cepthe stimuli from all 
of them became condi- 
tioned to finger retrac- 
tion. If oceasionaliy the 
lid closure should have 
occurred later than the 
other reactions, the pro- 
prioception from the lat- 
ter might easily have 
evoked the finger retrac- 
tion alone. While these 
considerations compli- 
cate the interpretation 
of Shipley’s results to 
the extent that they do 
not constitute an un- 
equivocal proof of the 
mechanism of secondary 
generalization, the ex- 
periments do demon- 
strate the existence of 
^ondary generalization 


PART A 


LIGHT 


SHOCK 



WINK 


PART B 


^WiNK 


N /W 


SHOCK' 

^RETHACT— Ip 


Ref 


PART C 

LIGHT WINK P^ RETRACTION 

Fig. 45. Diagrammatic representation of the 
evolution of secondary" stimulus generalization 
in the Shiple3’-Lumsdaine experiments. Part 
A shows the basic convergent mechanism, the 
light-wink portion having been set up by 
means of a previous conditioning proce^. 
Part B repre^nts the conditioning of the pro- 
pricK^eptive stimuli of two reactions evoked by 
a shock, each to the otha* reacticm through 
simultaneous occurrence clo^Iy a^ociated with 
reinforcement, i.e., ces^tion of the shock. Part 
C shows the final indirect generalization. The 
light evok^ the wink (from Part A) ; the wink 
produces a proprioceptive stimulation iPw 
from Part B); and Pw evokes a finger retrac- 
tion as conditioned in Part B, Thus the finger 
retraction has been indirectly generalized from 
the Miock to the i%ht through the medmti<m 
of the wink reaction upon which both 
and shock as stimuli converge. Throughout 
this diagram the arrows with ^lid shafts repre- 
^nt receptor-effa^tor connections which were 
in existence at the outset of the learning proc- 
here under consideration, and the arrows 
with broken shafts repre^nt connections 
up during the experiment. 
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and at the same time offer a convenient illustration of a plausible 
explanator}^ mechanism. This is shown diagrammatically in Figure 
the legend of which gives a somewhat detailed explanation. 

It is evident that when to the complexities of primary general- 
iiatioB mentioned earlier in the chapter there are added those natu- 
rally arising from secondary generalization (p. 191), the task of 
predieting generalization effects becomes almost hopeless because 
seeondaiy' generalization is so largely dependent on fortuitous ele- 
ments in the history^ of the individual organism, and these are 
usually not known to the investigator. In this connection we may 
note the great facility of normal human beings in the acquisition 
and use of ^>eech reactions and the recent experimental evidence 
that speech reactions operate in subtle ways to mediate secondary 
generalization (1). Because of these considerations, the results of 
introspective or verbal reports of the existence of generalization 
eontinua which do not conform with a reasonable approximation to 
some objective stimulus continuum in situations where interaction 
effects are presumably not marked are open to a certain amount of 
doubt. When uncertainty arises as to the status of such dimen- 
sions, the situation should be clarified by a comparison of the 
results of introspection with the generalization gradients produced 
in naive organisms presumably lacking the mediating speech habits. 

An im{H)rtant conclusion flowing from the preceding considera- 
tions is that the common-sense notion of similarity and difference 
m bmed ujmm the presence or absence of primary generalization 
gmdimtSf whereas so-called logical or abstract similarities and dif- 
ferences^ arise from secondary, learned, or mediated similarities 
mnd ii^erences, partictdarly those mediated by verbal reactions. 

TOE ^^SHMULIJS-mARNIHG’^ AND ^^STIMULXJS-EVO CATION” 
PAEADOXIB AND THEIR RESOLUTION 

*Ihe cimveiitional repre^ntation of learning as the formation of 
rimple bonds giv^ rise to certain paradoxes. The flux of the world 
to which oi^anims must adapt has infinite variety, and therefore 
specially conditioned stimuli, are never exactly repeated. 
But mpa^hrashold (adaptive) reaction potentials (p. 326 ff.) 

fca* emmple, tibe a amilar ity among weapons: this lies hardly 
lA ii acrimted. Odier examples in point are: similarity 

m d^ppe erf value, weight, height, etc., as represented by nmn- 
a» ^mmiy 10 mid 90 are hardly as different as are 10 and 

Si— m M m great as is 12 — 10. 
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usually require more than one reinforcement to be raised above the 
reaction threshold. Since the stimuli are not exactly repeated, how 
can more than one reinforcement occur? This is the stimulus- 
learning paradox. But even if a superthreshold bond should be 
established, it becomes a mystery how it could ever evoke a reac- 
tion at a time of need, because the exact stimulus w’ould probably 



Fig. Graph showing how the subthreshoid primary stimulus generalim- 
tion gradients from five distinct points on a stimulus continuurd theoretit^ly 
may summate to superthreshold values not only at the points of reinforee- 
ment but at neighboring points which have not been reinforce at all. Solid 
circles represent the results of single reinforcements; hollow circles reprint 
the results of summation. The reaction thrediold is arbitrarily taken at 5. 
Note that the major reaction tendency accumulates especially at the mid- 
point of the distribution of stimulated i>oints on the continuum, but that 
miperthreshold reaction potentialities extend beyond the range of the pointe 
conditioned. 

never again be encountered. This is the stimulus-evocation para- 
dox. The principle of primary stimulus generalization now avail- 
able enables us to resolve both of these paradoxes. 

Let it be supposed, in a particular reinforcement situation in 
which an effective habit strength of five hsbs is a minimum nec^ 
sary to evoke reaction, that each reinforcement connects the con- 
ditioned stimulus to the reaction wdth a strength of three habs; 
that five reinforcements occur, the five conditioned stimuli involved 
falling on the same stimulus continuum at uniform intervals of 


I5nS prinoples of behavior 

10 J,ii.d/s; that for a given potential stimulus each j.n.d. of addi- 
tional deviation on the stimulus continuum from the point condi- 
tioned decreases the effective habit strength by approximately one 
and that the several habit strengths thus active at a 
given jmint on the stimulxLS continuum summate to prodioce a joint 
habit strength, as would the number of reinforcements necessary 
to each if they were to be given in some standard rein-* 

forcement sequence i Postulate 4). 

The dynamics of this supposititious situation are represented 
diagrammatieally in Figure 46. The habit strengths of primary rein- 
forcement are shown by the five solid circles, the generalization 
gradients of each being indicated by the negative growth curves 
sloping downward in tw'O directions. From an inspection of these 
o\’erIapping gradients it is evident that in addition to the three 
habs arising from the reinforcement at a given pointy there are to 
be combined four lesser generalization values. A little further 
study will show' that the nearer to the middle of the distribution 
of conditioned stimuli a point of reinforcement stands^ the larger, 
upon the whole, will be the generalization values to be combined. 
Combining these five sHm values at each of the five points of rein- 
forcement and at intervals of 5 j.n.d.'s on either side, there is 
obtained the seri^ of summation values represented by the upper 
mixe drawn through the five hollow circles. An inspection of the 
latter eurt'e di^loses the following: 

!, A number of subliminal reinforcements conditioning the same 
reaction to dutinct stimuli closely spaced along a stimulus con- 
tmmm may yield an unbroken superthreshold zone of habit 
strengths eadending well beyond the range of the conditioned stimuli 
im qmM wTL 

IL TM pomt of rrummum habit strength tends to fall at the 
middle of the distribution of conditioned stimuli. 

HI. Pomts on the stimulus continuum between two points of 
but themselves not reinforced at all, have an effec- 
iim habit strength only a little less than the mean of the strengths 
of iM aijmemt reinforcement points. 

IV. PoiMs m the stimtdm continuum falling beyond the range 
of tM involved in the conditioning process also rise above 

ike ikrmhedd but in progressively smaller amounts the 

mme they are frmn the central tendency of the distribution 

of tks m^^wmd. 
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By the first item in the above summary, isolated stimuli sub- 
liminally reioiorc^ onh^ a single time ultimately become supra- 
liminal through the summation of effective habit strength (ssHm) 
generated at the point in question with that generalized from other 
points of reinforcement; thus is resolved the stimulus-learning 
paradox. By the summation of generalized effective habit strengths 
from adjacent stimulus points of reinforcement, supraliminal habit 
strengths evolve at points which ha%"e never been reinforced at ail; 
thus is resolved the stimulus-evocation paradox. 

SEMMABY 

Under favorable experimental conditions in a learning situation 
both the conditioned and the xmconditioned stimuli may be held 
relatively constant. In this way a coimection is said to be set 
up in the nervous system between the afferent discharge (s) aroused 
by the conditioned stimulus {Sj and the efferent discharge (r) 
which leads to the reaction (R). Actually, however, very much 
more than this results; the reaction is conditioned not only to a 
tone (Si but to a whole zone of tones of other pitches and inten- 
sities spreading in both directions along each dimension from the 
point conditioned. All of these stimuli are functionally equivalent 
in that they have the capacity to evoke the same reaction. This 
spreading of the results of learning to other stimuli is called pri- 
mary." stimulus generalization. The fact that many stimuli alike 
possess the potentiality of evoking the same reaction constitute 
primary Btimnlm equivalence. 

Experiments show, however, that iie strength of the habit gen- 
eralized to stimuli other than the one originally conditioned dimin- 
ishe progressively as the difference between S and 5 increases. 
When the magnitude of this difference is measured in units of the 
discrimination threshold (j.n.dj, the gradient of generalization 
closely approximates a simple negative growth or decay function. 

The introduction of the phenomenon of primary stimulus gen- 
eralization makes it quite clear that knowing the habit strength 
at the approximate points of reinforcement is now sufficient to 
enable us to predict the reaction potentiality, motivation (drive) 
remaining constant. The actual or effective habit strength mobiliz- 
able by a given evoking stimulus (S) is a joint function of the 
habit strength at the point or points reinforced and the difference 
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on tii 0 ir g^oncrslizEtion continuum between the point or points of 
reInforeeEient and the stimulus point of evocation. There thus 
emcTEes the necessity for a new s^^mbolic construct, that of effective 

habit strength 

The euncept of '‘stimulus dimension” may be contrasted with 
liuit of “afferent generalization continuum.” The first of these 
expressions refers to the physical characteristics of the stimulus 
the second, to the characteristics of the corresponding affer- 
ent disdiarge initiated by the action of the stimulus energy upon 
the receptor. Discrimination experiments indicate that there is not 
a one-t-o-one parallelism between these two variables. It is held 
that the number and nature of the various primary generalization 
gradients are caused jointly by the nature of the stimulus energy 
and the nature of the receptor response. The j.n.d. is also a joint 
function of the nature of the stimulus energy and the nature of 
the receptor response. It is probably because of this that general- 
ization is a more simple and uniform function of distance on the 
generalization continuum w’hen the latter is measured in j.n.d.^s 
than when measured in the ordinary physical units of the stimulus, 

A second form of stimulus generalization applies to stimulus 
compounds. The equivalence of two or more stimulus compounds 
in their capacity to evoke the same reaction may depend upon (1) 
the presence in each compound of certain identical (or similar) 
stimulus elements or aggregates, (2) the reaction becoming condi- 
tion?^ to the i^veral stimulus elements or aggregates in one stim- 
ulus compound, and (3) the common stimulus element in the second 
stimulus compound tending to evoke the reaction much as it did in 
the original compK^und. 

The imge of primary stimulus generalization has limitations^ 
particularly in the spread of reaction tendencies from one receptor 
to miother. Stimulus equivalence in such cases is brought 
atout by an indirect prcK^^ known as secondary generalization, 
TMs evolvij^ by a series of steps: (1) energies of distinct stimulus 
l^ome i^ditioned to the same reaction by direct reinforce- 
m«it; (2 1 a ruction may later be conditioned to one of 

the slinsiilus (3) still later, if some other stimulus also 

^mditicmod to the first reaction but not to the second should im- 
mk the organisn, tiiat stimulus will evoke the first reaction^ 
i^d propi^^ptiem of this imction will evoke the second reac- 
A chain of thk kind mediates secondary 
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or indirect stimulus generaiizatiorij a second form of stimulus equiv- 
alence. 

The summation of overlapping priman.’ stimulus generaliza- 
tions, even if definitely subliminal at their point of reinforcement 
and even if each stimulus point is reinforced only once, may be 
shown to raise the effective habit strength above the reaction 
threshold not only at the points reinforced but at neighboring 
stimuli which have never been reinforced at all. In this way are 
explained both the paradox of the occurrence of superthreshold 
learning where the conditioned stimulus is never exactly repeated, 
and the paradox of reaction evocation where the evoking stimulus 
has never been associated with the reaction evoked. 

In view of the preceding considerations we may formulate 
Postulate 5: 

POSTULATE 6 

The effecdve habit strength sBr is jointly (1) a negative growth 
function of the strength of the hatnt at the point of reinforceinent (S) 
and (2i of the magnitude of the difference (d) on the continuum of that 
stimulus between the afferent impulses of s and s in units of discrimi- 
nation thresholds (j.md.’s) ; where d represents a qualitative difference, 
the slope of the gradient of the negative growth function is steeper than 
where it represents a quantitative difference. 

From Postulates 4 and 5 there follows an important corollary; 
because of the frequency of its use it is here given special promi- 
nence: 

MAJOR COROLLARY I 

All effective habit tendencies to a given reaction, whether positive or 
negative, which are active at a given time summate according to the 
podtive growth principle exactly as would ffie reinforcements whidh 
would be required to produce each. 

NOTES 

Mathematical Statement of Postulate 5 

Ihis pcBtulate is ex:pr^ed concisely by the equation: 

sBb = (29) 

where, 

sHe h as given in equation 16, 

d m difference between S and S in j.n.d.\ 

f is an empirical <X)nstant of the order of .01 in the case where d is a qualita- 
tive diffemnce tmt of the order of .006 where d is a quantitative difference. 
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Mathematical Statement of Major Corollary I 


SS^M = Si — 


4- (- D- 


-1 ■■ j”- 

Mn-V 


where Si is the simple sum of the items to be combined, S2 is the sum of the prod- 
ucts of al combinations taken two at a time, S3 is the sum of aU products taken 
three at a time, etc., and M is the physiological limit of the learning process, in 

thk case a^umed to be 100. 

The Hovland Stimulus Generalization Gradients 
HotIbM's numeric^ values from which Figure 42 was plotted are as follows: 


No. didard from 

poini of reinJoTcemerd (d) 
0 

25 

50 

75 


Amplitude of galvanic sJdn 
reaction in millimeters (A) 
18.3 
14.91 
13.62 
12.89 


The r^ative grwrth function fitting these data rather well is: 

A = 18.3 - 6 (1 - lO- oias^J), 

where A m the amplitude (in millimeters) of the galvanic skin reaction to stimu- 
Istkm, 12.3 represents the asymptote or l imit of fall of the value of A, 6 is the 
maximuin amount of change in A due to generalization, and .0135 is a constant 
depending in jmrt on the st^ness of slope of the generalization function and in 
on the unite emjdoyed. This ^nation is represented by the smooth curves 
drawn data points in Figure 42. 

The mmotei curve drawn throi^ the data points of Figure 43 corresponds 
to die equa&m: 

A = 14.3 ~ 2.24(1 - KF-OQSid). 


Tise MeliKKi of Etenving from Empirical Values the Constants for the 
Growth Function Fitted to Hovland’s Data 

It was ocK^lu^d from an inspection of the data in graphic form that the 
ftiKttee was a demy or n^ative growth variety with an as3nnptote 

©rwirar tiian zam, Le., that the equation would probably of the 

fam: 

A « a -f (5 — a)10”*^, 

la I k tiie value of A when d — zero. This is given directly by 

^ia. M 18 . 3 , Substituting, the equation becomes : 


A * o + 


18,3 -- a 


1ft ©qmtiML, A Mid i lyre pvea by the table of empirical values, which leaves 
a®d €, These Taiu^ are found by means of simultaneous 
1^1 d which may be up from Hovland^s empirical results. 

♦ Hk mM Arthur 8. Day. 
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Taking i = 50 for one equation, and d = 75 for the other, and sutetltuting the 
corn^ponding A-valu^ from Hovland’s data, we have: 


13.62 = a 


18.3 - a 


12.89 = a 


18.3 - a 

lOrsi 


Solving this |mir of simultaneous equations we find that a = 11.1 and h = .W8. 

Setting up the two remaimng significant simultaneous eombinations afforded 
by Hovknd's dak. we obtain additional -values for both a and k. The whole 
seri^ of valu^ is as follows : 


d 

a 

h 

2Bmdo0 

12.9 

.017 

25 75 

12.5 

.015 

50 “ 75 

11.1 

.m 


Taking the approximate central tendency of the three values for each <M>iistant 
we have, a = 12.3 and k = .0135. Substituting, w’e have: 


A 


= 12.3 -f 


18.3 - 12.3 

IQ .0135 


= 12.3+6 X 10- 01^^, 


which is the equation sought. 


The Derivation of Figure 46 

At the outset it is assumed that single subliminal reinforcements were made 
at five points on an afferent generalization continuum : 120, 130, 140, 150, and 
160 J.n.d.*s distant from a rather remote common point marked zero in the figure. 

Next, the generalization value from each point of reinforcement was calcu- 
lated by the equation: 

sffn = 3Xl0-«i*5«*. (31) 

For reasons given in the text (p. 186), this ^uation has b^n adapted from 
derived from Hovland's data as a ^juiwier and more ge]:ieral equa&m im the 
generalization gradient. In this way d>tained the re^ts giraa in 
fdlowing table: 


c^mj.n,d, (d) 

Gmerdiz^ habU 

strength (sHb) in halm 

0 

3.0 

5 

2.57 

10 

2.20 

15 

IM 

m 

1.61 

m 

1.18 

40 

,87 

50 

,63 

60 

.46 

70 

.34 

80 


90 

.18 

100 

.13 

110 

.10 
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From this table were obtained the generalization values sloping downward 
and away in each direction from the crests which appear in the lower part of the 
figure at the five points of reinforcement. 

The next step in the process was to combine the overlapping generalization 
tendencies which appear at every point on the stimulus continuum as five items. 
For example, at 1^ the first item is 3.0 because here d = zero. Next, there is 
the generalization value of the gradient originating at 130, which is 10 j.n.d.’8 
dstant. By the above table this would yield 2.20 habs. Then there is the 
generalization value originating at 140. Since 140 is 20 j.n.(L^s from 120, d = 20, 
which, by the £dx>ve table, eorr^ponds to a generalization value of 1.61 habs. 
In a timilar way the other two sHb values with d's of 30 and 40 are shown by 
table to be 1.18 and .87, respectively. 

We thiB have the problem of combining 3.0, 2.2, 1.61, 1.18, and .87 habs. 
How fi-hft.ll this be done? The hypothesis which fits in best with various related 
eminrical observations is that the generalized effects of learning sum mate in 
way as do the effects of repetitions in the learning process. The prin- 
ciple according to which repetitions of reinforcement combine to produce halnt 
ti^dencie have b^n explained in considerable detail in Chapter VHI and are 
stated concisely in Postulate 4 and in equations 1, 26, and 30. It will be sufficient 
here cmly to say that each repetition was supposed to increase the amount of 
habit strength by a constant factor (e.g., 1/10) of the learning potentiality not 
yet realized in actuality. This means that the amount of habit strength con- 
tributed by one repetition of reinforcement late in the learning process is very 
much 1^ than that contributed by one repetition at the beginning. In a similar 
manner, a Hock of five repetitions given late in the learning process will contribute 
tes to tl^ habat steaagtii than an exactly similar block of five repetitions early 
in It was a^imed that the five generalized tendencies summate 

m would ti)^ numb^ of r^>etitions required to produce each. Thus one 
might mib^tute the value of in equation 26: 

sHr = m(l - lon^ 

^:ive lea* AT in the <ase of each of the five values listed above, add togeth^ 
set of N^s th^by chtained, and, finally, substitute the sum of the five 
in the equation, this time solving for sHb; the result of the last operation 
wi^iH tiie mimmation required. 

Tlie above pttKsedur^ while cont^ptuaJly simple, is very Humsy mathe- 
the matiieinatical implications of the assumption have 
fee® €8it fear the general ^se where n values must be summatedL This 

^ form erf ^ven in the second terminal note above (30). 

The ^ that filiation will be made clear by the following example: 

* aOO 4- ^ + 1.61 4- 1.18 4- .86 = 8.85. 

« tm X 4- 3.00 X 1.61 4- 3.00 X 1.18 4- 3.00 X .86 4- 2,20 X 1.61 
4- laj X i.18 4- 2.^ X .86 4- 1.61 X 1.18 4" 1.61 X .86 4- 1.18 X .86 
» «.» 4- 4^ 4- 3.54 4- 2.58 4-3.54 4-2.60 4- 1.89 4- 1.90 4- 1.38 4- 1.01 

= msi 

^ « S.O0 X 2J0 X 1.61 4- 3.00 X 2.20 X 1.18 4- 3.00 X 2.20 X .86 4- 3.00 
Xl.il X 1.18 4- 3.00 X 1.61 X .86 4- 3.00 X 1.18 X .86 4- 2.20 X 1.61 
XL18 4-2^ X IM X 4-2.20 X 1.18 X .86 4- 1.61 X 1.18 X .86 
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Xi = 3.00 X 2.20 X 1.61 X 1.18 + 3.00 X 2.20 X 1.61 X .86 + 3.00 X 2.20 
X 1.18 X .86 -f 3.00 X 1.61 X 1.18 X .86 + 2.20 X 1.61 X 1.18 X .86 
= 12.54 T- 9.14 -r 6.70 + 4.90 + 3.59 
= 36.87 

Si = 3.00 X 2.20 X 1.61 X 1.18 X .86 
= 10.7& 




36.87 


10.78 


100 ‘ 10,000 1 , 000,000 * 100 , 000,000 

= 8.85 - -i- .0048 - .000037 + .00000011 

= 8.55 


wMcli is the value of ssI^r at on the upper or summation grapli in Figure 46. 
All of the other points in the summation curve were computed in an analogous 
manner. 

In general this method of summation yields a value appreciably less than 
would be obtained by the simple addition of the items summated, the shrinkage 
being greater the nearer the individual items approach the magnitude of the 
physiological limit (3f), in this case, 100. Because of the relatively small size 
of the items, the shrinkage here is slight. 
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CHAPTER XIII 


Some Functional Dynamics of Compound Conditioned 

Stimuli 


Most accounts of conditioning experiments tend enormously to 
minimize the actual complexity of the factors involved. Indeed, 
this is almost a necessity; if the reports of such experiments should 
contain a really complete description of the process the reader 
would be so swamped in detail that he might easily fail to under- 
stand the main point of the experiment. The same expository diffi- 
culty is encountered by behavior theorists in perhaps an even more 
aggravated form and has led to the same type of misrepresentation. 
An example of this is our own use of the symbol sHs- There is 
small doubt that such expository over-simplifications in the ac- 
counts of learning and other behavior situations have genuinely 
misled many persons beginning the study of behavior and, pos- 
sibly, in some cases even the investigators themselves; they cer- 
tainly have produced much misapprehension among the philosophi- 
cal critics of behavior theory, who as a rule have little or no first- 
hand knowledge of the phenomena concerned and so are especially 
prone to such mismderstandings. In Chapter XII something was 
done to remedy this w'holly natural yet regrettable situation by 
considerably expanding the concept of habit with respect to the 
stimulus; a parallel expansion on the response side will be pre- 
seited in Chapter XVII. In the present chapter we shall seek to 
clarify the conc^t of the stimulus (S) still further by deriving a 
number of elementary corollaries from the conditions under which 
learEung oecuts when considered in conjunction with certain 
peimary principles; Uie latter are fca: the most part already familiar 
to toe r^er. 

THE OOMPI^EXTIT OF THE “STIMITLTJS” OF A TYFICAD 

oosnomoNiNG smTAnoN 

In mde’ to clarify to some degree the actual complexity of the 
"sUsHilas” inwcdved in typical habit formatioh, let us consider in 
a httie detad this aspect of what is usually regarded as one of the 
suae leanung ratoations, that of the Pavlovian conditioned 

204 
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reflex (6). Previous to the beginning of the conditioning process 
the dog has had no food for 24 hours. It stands upon a table in 
the laboraior}', being kept in place by loose bands attached to a 
wooden framework. Some weeks previous to the experiment one 
of its salivaiy’ ducts has been surgically diverted so that saliva is 
discharged through a fistula in the side of the animaFs face. When 
the experiment actually begins, a capsule is cemented over the 
fistula in such a way that it collects the saliva seeping through the 
opening; the pressure in the capsule resulting from the entrance of 
the saliva is transmitted by a rubber tube to a sensitive register- 
ing device. 

An electric buzzer is sounded near the dog for five seconds; 
two seconds after the termination of the buzzer action a small 
quantity of meat powder is bloTO into the animaFs mouth by means 
of a rubber tube held in place by a kind of muzzle. This powder 
is eaten, with a profuse accompanying flow of saliva. After a few 
repetitions of this sequence, the dog begins secreting saliva during 
the se%*en-second inter\*al between the beginning of the sotmd and 
the delivery of the meat powder, shotting that the conditioned 
reflex has been set up. 

The above summary description of the conditioned reflex experi- 
ment is a fairly typical example of the accounts usually^ the 

only conditioned stimulus element specifically mentioned as active 
in the situation is the buzzer.* As a matter of fact, the buzzer 
'vibration makes up only a small part of the total number of stim- 
ulus components involved. Moreover, the wave pattern of the buz- 
zer itself, as revealed by the cathode ray oscillograph, is an exceed- 
ingly complex phenomenon and doubtless stimulates simultaneously 
a very large number of the ultimate auditory receptors in the 
cochlea. 

Among the manyr additional components of the conditioned 
stimulus (S) not ordinarily mentioned are: the fact that the ani- 
maFs two ears receive the buzzer vibrations with diifferent intensity 
or in different phase, depending on (1) the direction of the bell 
from the dog’s head and (2) the orientation of the head at the 
moment; the pressure of the dog’s feet against the table top upon 
which it stands; the pressure of each of the three or four restrain- 
ing bands upon the skin receptors of the dog^s neck, thighs, etc.; 
the biting of a number of insects which may be hidden in the 
dog’s hair; the contact of the capsule over the fistula; the pressure 
of the muzzle against the dog’s head; the pressure of the rubber 
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tube in the dog's month; the odor of the rubber from which the 
tube is made, together with a large number of miscellaneous odors 
to which the human olfactory receptors may or may not respond; 
the multitude of visual stimuli of light, shade, spatial combinations, 
ete.j arising from the laboratory lamps and reflected from millions 
of points within the dog's visual field; the proprioceptive stimu- 
lation arising from the external and internal muscles of the dog's 
eyes as they fixate one object after another about the laboratory; 
the infinite number and variety of proprioceptive impulses origi- 
nating in the several parts of the other muscles of the animaPs 
boiy as they are employed in the maintenance of the postures 
taken from moment to moment; the too-little understood stimula- 
tions associated with the bodily state resulting from food, water, 
and sexual privation, rectum and bladder pressure, etc.; and, finally, 
the perseverative traces of all the multitude of stimuli recently 
acting, whether the stimulus energy is continuing to act at the 
moment or not. The conditioned stimulus in the experiment under 
consideration includes all of the immensely complicated stimulus 
elements here enumerated and many more besides; nevertheless 
this list, incomplete as it is, should aid the reader somewhat in 
overcoming the misleading suggestion of singularity and simplicity 
otherwi^ likely to be conveyed by the S of the symbol, sHr. 

THE DiSTOIBUTION' OF HABIT STRENGTH ACQUIRED BY THE 
SE\’ERAIi ODHPONENTS OF A STEMULUS COMPOUND 

The law of primary reinforcement as formulated in Chapter VI 
pr^ented, in the interest of introductory expository clarity, the 
ultra-rimple view of the conditioned stimulus which we have just 
beaa at Hime pains to rectify. We must now consider the opera- 
tion of this principle imder the present expanded conception, par- 
m it appli^ to the several types of components which 
may be found in a stimulus compoimd. 

AceoftiiBg to the of reinforcement' laid down earlier (p. 
8CI), one of the receptor discharges and receptor-discharge 

^o^veiations active at the time that the to-be-conditioned reac- 
tioi must acquire an increment of habit strength ( AsHb). 

of iMs principle, coupled with the recognition of 
tfee iml^bcity varied of these afferent elements, at once 
suaimi® critic qu^ticms, one of which is: Are these incre- 
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merits of habit strength all of the same magnitude? The answer 
to this question is quite definitely that the increments of habit 
strength acquired the several afferent discharges arising from 
the various stimulus aggregates represented by such w’-ords as 
^•buzzer sounds” ^‘odor of food,” “sight of food cup,” “pressure of 
restraining bands,” etc., differ widely. In this respect the situation 
is believed to be substantially as represented in Figure 47, where 
the thickness of the broken lines connecting the several stimulus 
aggregates {SVj represents the vaiydng mag- 
nitudes of the increments of habit strength 
acquired by them at a given reinforcement. 

The S s showm in the diagram are, of course, 
far too few’ to more than suggest the number 
of actual stimulus elements, or even the num- 
ber of the aggregates of stimulus elements,^ 
in the typical conditioning situation. 

Recognition of the variability in the incre- 
ments of habit strength acquired by the sev- 
eral stimulus components of a conditioning 
situation at once raises the question of the 
principles according to w’hich the differential 
magnitudes of habit-strength loadings arise. 

This question does not permit a very definite 
answ’er, though a certain amount of experi- 
mental effort has been directed to this end. 

One bit of e\ddenee comes from the Kappauf- 
Schlosburg experiment described above (p. 

166 ), which showed that stimuli which have 
continued to act on a receptor without change 
for some time have a greatly diminished ca- 
pacity for acquiring habit loadings. Partly 
for this reason it is probable that static, i.e., 
unchanging, elements or aggregates in a conditioned stimulus 
situation are considerably less potent in the acquisition of habit- 
strength loadings than are the more dynamic, i.e., chan^g, ele- 
ments or a^regates. This probably is why investigators so fre- 
quently neglect to take into consideration the static or constant 


h 

Fig. 47. This figure 
represents the sheaf 
of habit tendencies 
(AsHm) presumably 
set up by each rein- 
forcement. The thick- 
ness of the dashes 
leading to the several 
arrow points is in- 
tended roughly to repn 
resent the differences 
in the magnitude of 
the habit increments 
or habit loadings con- 
necting the several 
stimulus elemente in 
the conditioning situ- 
ation, with the reac- 
tion process. 


^ A stimuLm element is defined as the action of a stimulus energy upon a 
angle receptor organ, such as a single rod of the retina or a sin^e touch 
or^n of the ridn. A stimvlm aggregate is a group of stimuli which ordi- 
narily begin and end concurrently and, in general, combine to perform the 
same adaptive functions. 
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elements of conditioned stimulus situations, sometimes with unfor- 
tunate results. 

A second factor which ma}’' be of some importance in deter- 
mining the habit-strength loading acquired by a stimulus aggre- 
gate in a learning situation is the intensity of the stimulus energy. 
Pavlov reports (6, p. 142), on the basis of admittedly inadequate 
empirical data, that when two stimulus energies of different inten- 
sities operate on the same receptor simultaneously, e.g., two differ- 
ent tones, the stronger stimulus receives a greater increment of 
habit strength. 

A third factor is the receptor or ^^analyzer,” which receives the 
stimulus energy. Pavlov reports (6, p. 143), again without ade- 
quate supporting e\udence, that, other things equal, tactual stimuli 
seem to acquire stronger habit loadings than do thermal stimuli 
"and that auditory stimuli are stronger in this respect than visu^ 
stimuli. Recently Zener has reported orally a well- controlled 
study which fully establishes this proposition for relatively low 
stimulus intensities. 

A fourth possible factor in determining the relative habit load- 
ing of the several components of a stimulus compound concerns 
whether or not a stimulus appears in a large number of conditioning 
Htuations which require a wide variety of reactions to bring about 
need reduction, and also in many situations requiring no reaction 
whatever. For example, daylight is present as a visual stimulus 
component in thousands of different reaction situations. It seems 
likely that the great number of reactions to which this stimulus 
comfxjnent is conditioned early in life, coupled with incidental 
extinction eff^ts which necessarily result from such a state of 
affairs, would s<x>n largely blur out the capacity of such stimuli 
to eonditicm^ to any reaction in particular. 

A fifth f^tor which follows as a kind of corollary to the factor 
of intea^ty m that a stimulus component which has previously 
oomutimied to a reaction involving strong autonomic or 
ssp^te, e.g., a fear reaction, will presumably acquire 
m tMi indirect way a stronger habit loading than would a eom- 
^nent not so condition^. This would be expected on the assump- 
tion that the proprioceptive and other receptor discharges entailed 
by the Incurrence of the conditioned reaction in question would 
a relatively intense stimulus which, as such, would ac- 
^ife a ©orr^^®idii^y fa^vy habit loading from the reinforce- 
A very w^k stimulus throu^ lack of vigorous 
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competition or through especially powerful reinforcement in ’an 
earlier conditioning situation might thus acquire control of a reac- 
tion yielding a powerful stimulus and in this indirect way attain 
the appearance, in subsequent conditioning situations, of having 
itself a strong capacity for acquiring habit loadings. It seems 
likely that this mechanism explains to a considerable extent the 
role in the learning process of what is reported introspectively as 
“attention/' 

Unfortunately as yet very little experimental effort has been 
directed to the solution of these problems. For this reason most 
of the suggesticms listed above must be regarded as hardly more 
than conjectures suitable as points of departure for future inves- 
tigations. 

THE JOINT BH^CTION-EVOCATrON POWER OP COMPOUND 
STIMULUS AGGREGATES 

Among the problems precipitated by the highly compound char- 
acter of the conditioned stimulus there is the question of the 
relative action-evoking power (and, presumably, of effective habit 
strength) of a stimulus element or aggregate (S) when acting in a 
stimulus compound as contrasted with that when it is acting alone. 
There are two cases; one is that in which the stimulus components 
are separately conditioned to the same reaction and are later tested 
for power of reaction evocation by being presented to the subject 
simultaneously as a stimulus compound. The other is that in 
which the stimulus components are presented simultaneously as a 
compound during the conditioning process and are later teted for 
power of reaction evocation by being presented to the subject sepa- 
rately. Here, as in so many other aspects of beha\dor science, 
really adequate empirical investigations are largely lacking. How- 
ever, some experimental results are available which are useful for 
illustrating the nature of the problems and for suggesting plausible 
hypotheses, even if not adequate to serve as a basis for final deci- 
rions. 

We proceed now to the consideration of the empirical evidence 
concerning the case where the stimulus compon^ts are conditioned 
^pamtely to the same reaction. In one illustrative experiment 
eight human subjects were first condition^ to a weak light, the 
unconditioned stimulus being a brief electric shock and the resx)on^ 
m^orded, the galvanic skin reaction (:^). Next, a weak vibratory 
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stimulus applied to the forearm was paired with the shock and 
thus conditioned to the galvanic skin reaction. When presented 
alone the light (5z) evoked a mean reaction (Ri) of 3.5 millimeters, 
and the vibrator (5r) evoked a mean reaction (R^) of 3.6 milli-* 
meters. When both stimuli were presented simultaneously (Si+r), 
they jointly evoked a mean reaction (Ri + v) of 44 millimeters. 

Since the two stimulus components were of almost equal reac- 
tion-evocation strength when presented separately, it is reasonable 
to suppose that they contributed about equally to the joint evoca- 
tion when they were presented simultaneously, i.e., that each con- 
tributed approximately half of the 4.4 millimeters of the joint 
evocation, or 2.2 millimeters. These results, which are typical of 
a number of fairly comparable sets now available, indicate a defi- 
nite shrinkage in the reaction-evocation power (and so, presumably, 
of effective habit strength) at the command of stimulus compo- 
nents in combination, as compared with that when acting sepa- 
rately. 

A simple quantitative index of this shrinkage is obtained by 
dividing the amplitude of the reaction evoked by the joint stimu- 
lation by the sum of the amplitudes evoked by the separate stimu- 
lations: 

^ 4.4 _ 44 _ . 

Bi + R^ 3.5 + 3.6 7.1 * 

This is interpreted to mean that a stimulus when presented jointly 
with another evokes only about 62 per cent as great a reaction as 
it dc^ when printed separately, thus showing a shrinkage of 
3^ per cent, or about a third. A comparable experiment (B), also 
employing eight human subjects, yielded the following values: 

_ 3.91 _ 3.91 _ , 

Ri + Rn 2.2-f3.7“" 5.9 

TWs also shows a shrinkage of almost exactly one-third. 

The ^^ond ca^ of the reaction-evocation power of compound 
is the reverse of that just considered. This is illustrated 
by a simple permutation of the series of experiments outlined 
ai»ve, the pr^nt experiment likewise employing eight human sub- 
Jecte iih The light and the vibrator, presented simultaneously, 
wtie pair^ witii the shock. The mean amplitudes of the condi- 
^vwdc sMn reaction evocable by the compound and by 
s^^ate were then determine. It was found that 

li^ (Si) evok^ a mean reaction mnplitude of 2.5 milli- 
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meters; the \ibrator alone (Sr) evoked a mean reaction amplitude 
of 2.9 millimeters; and the two components presented as a com- 
pound (Sj+r) evoked a mean reaction amplitude of 3.3 millimeters. 
Despite the difference in the arrangement of this experiment, the 
shrinkage of the reaction-evocation power of the stimulus com- 
ponents when in a compound as compared with that when pre- 
sented separately may be calculated by the same formula: 

^ — fti 4. 

Ri + R, 2.0 + 2.9 5.4 ■“ 

The calculation yields a value within the range of experimental 
error of that obtained from the experiments falling under case one. 

Another exactly comparable experiment from the same study, 
also employing eight human subjects, yielded the followung result: 

Ri+t _ 2.8 ^ 2.8 _ I 

Ri + R,~ 1.4 + 2.8 ^ 4.2 “ 

To the twm cases of compound stimulation already considered 
there may now be added a third case which is of special interest 
because it is neutral so far as possible interaction effects arising 
from the circumstances surrounding the conditioning are concerned. 
This is illustrated by an experiment employing eight human sub- 
jects (B) in vrhich the S >R connection -was the result of W’hat is 

called sensitization^ i.e., it was set up by merely administering the 
shock without the latter being paired with either the light or the 
\dbrator. This experiment yielded the following result: 


Ri-i-g 3.2 3.2 

Ri+^ 2.2 + 279 ~ Kl 


which is in close agreement with the results of the other four 
experiments. 

Thus, from the point of view of the quantitative index, 

R I R "^ 

all three cases, so far as the available evidence goes, agree in 
showing that where two stimulus a^regat^ are concerned a shrink- 
age of about one-third in reaction-evocation power occurs; i.e., 
from this particular point of view no marked difference between 
the three cases appears. This may very well be the sibiation 
which will finally be revealed by further experiment. 

There are, however, certain indications reported by Pavlov (6, 
p. 141) to the effect that where two dynamic stimulus compon^ts 
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are jointly eondidoned to a reaction, the stronger of the two may 
completely dominate, obscure, or '^overshadow’’ the weaker eom- 
rK»rit-nt in that when the latter is presented separately it will evoke 
no reaeiion whatever, though in certain indirect ways it may be 
shown to possess some kind of functional connection with the reac- 
tion. In this context it will be recalled that in the fourth example 
given above, which involved a reaction conditioned to a compound 
stimulus, the mean amplitude of reaction evoked by the compound 
12.8 millimeters I w’as exactly" the same as that evoked by the 
stronger of the two stimulus components, the cutaneous vibrator; 
the addition of the light to the combination seems to have added 
nothing to the mean amplitude of reaction. This outcome might 
quite iK^ibiy have been due to experimental "error,” i.e., to the 
fact that the sample of data collected was too small to yield a 
sufficiently precise indication of the relative influence of the several 
factors involvai. 

After weighing the various bits of experimental evidence avail- 
able, the most plausible empirical generalkation concerning the 
d>Tiaiiiies of comix)unds of conditioned stimuli in the reaction evo- 
cation is that aggregates conditioned to the same reaction 

irrespective of whether the stimvlus aggregates were con- 
ditkymd to the reaction as separate entities or as a stimvlus com- 
pownd, (1) a smallm' power of reaction evocation when presented 
family than when presented separately, hut (£) a larger joint power 
of reaction evocation than does any single component whm the 
latter u presented sejMirately. 


THE PUNCTPLl^ OP HABIT SUMMATIOK' ANT) OF MONOTOOTC 
HABIT-REACnON RELATIONSHIP AS APPLEEID 
TO STIMULUS CX)MPOUNDS 

^ ith ^me of the coarse empirical reaction-evocation 

djmmmcB of stimulus compounds before us, the question arises as 
to whether the above empirical generalizations represent primary 
priucipte or ^c^mdary principles derivable from a combination of 
primaiy principle already in the ^stem. The answer to this type 
of qii^tion dep^ds on whether or not the principle can be derived 
imm Cfther and supposedly more elonentary principles; if so, 
it a a principle; if not, it may be a primary principle. 

MAmMU that by this t^, tibe dynamics of reaction 

se^irf-oider, or dOTv^, principles. major 
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portion of these phenomena appear to be mainly dependent upon, 
i.e., derivable from, two principles already more or less familiar. 

The first of the primary principles in question concerns the 
manner in which two or more homogeneous habits, i.e., those in- 
volving the same reaction, summate to produce a joint habit 
strength. We encountered a situation of this kind above ip. 195) 
when we considered the summation of habit strengths arising from 
the overlapping of stimulus generalization gradients both of whieli 
w'ere conditioned to the same reaction. In that case we had the 
physiological summation of habit tendencies generated by the con- 
ditioning of the same reaction to two distinct stimuli but, tlirougli 
the process of generalization, brought to bear on the evocation of 
the reaction by the impact of a single stimulus aggregate. The 
present situation presumably has the same dynamic state of alfaim 
brought about by the simultaneous action of tw’O stimulus aggre- 
gates each of which has a tendency independently to evoke the 
same reaction. As in the case of overlapping stimulus generaliza- 
tions (Chapter XII, p. 199), the quantitative principle of summa- 
tion is given by Major Corollary I. As applied specifically to the 
present situation, this may be restated as follows: If two or more 
stimulus aggregates^ each independently conditioned to the same 
reaction^ impinge simultaneously on the receptors of the organkm 
in question, the effective habit strengths borne by the several slim’- 
tdus aggregates summate to produce a joint habit strength as would 
the separate effects of the number of reinforcements necessary to 
produce each, if such reinforcements were to be given consecutively 
in some standard reinforcement sequence. 

Thus, suppose that one stimulus aggregate, such as a weak 
light, has a habit strength to the evocation of a given reaction of 
34.39 habs, and a second stimulus aggregate, such as a cutaneous 
%dbrator, has by independent reinforcement a habit strength pf 
46.86 habs to the evocation of the same reaction. Now, by Table 1 
(Chapter VIII, p. 115), 34,39 habs would (under certain assumed 
conditions) be produced by four reinforcements, and 46.86 habs 
would be produced by six reinforcements. On the above principle 
it follows that the physiological summation of the two habit 
strengths would yield a joint habit strength equal to that which 
would be prcKiuced by 4 + 6 = 10 rehaforcements, which, by Table 
1, equals 6513 habs. 

The second principle presumably operating here is that of the 
numotordc habit-reaction relationship (p. 326 flF.). This is that 
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the strength of a reaction tendency is an increasing linear function 
of the effective habit strength mediating it; le,, if 


then 


SiHji > sJ^By 


Rsi ^ Rsi> 


SOME CXIHOLIABIES OF THE PRINCIPLES OF HABIT-STRENGTH 
CX)MBiNATION AND OF THE MONOTONIC HABIT-REAC- 
TION RELATIONSHIP APPLIED TO CONDITIONED 
STIMULUS COMPOUNDS 

It follows from the above principles that the sum of the reaction 
amplitude mediated by 34.39 habs and of that mediated by 46.86 
habs will be an increasing function of 34.39 + 46.86, or 81.25, 
whereas the magnitude of the reaction mediated by the joint action 
of the two stimuli will be a corresponding function of 65.13. How- 
ever, 

65.13 < 81.25. 

We accordingly formulate Corollary I as follows: 

1. The amplitude^ of the reaction evoked by two stimulus ag- 
gregates acting lointly will be less than will be the sum of the 
reaction magnitudes evoked by the respective stimulus aggregates 
acting separately. Stated in a formal manner, Corollary I becomes 
the following inequality: 

9 < + Rr- 

Returning, now, to our empirical data we find ample illustra- 
tion in all five experiments. For example, in the first experiment 

we find, 

= 4.4 

aim 

+ 12, = 3.5 + 3.6 = 7.1 

Le,. 

4.4 < 7.1, 

whieli M!y the above inequality. 

A s^ondary corollary which flows from the same assumptions 
mmcems the re^tion ma^itude mediated by the summation of 

amplitude appears to hold only for certain 
» the galvanic i^ln i^ction, here used as illustrative mate- 
olivary ami the lid reaction. For certain other typ^ of 

m are normally ^prcMiticed by the striated muades. pro-bability 
^ Cp) at stimulation, lateiKy or reristance to experimental 

wfil to sd^tuted for amplitude. 
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two habits as contrasted with that mediated by either habit alone. 
We have seen that the joint strength of a habit of 34.39 habs and 
one of 46.86 habs amounts, theoretically, to 65.13 habs, which is 
greater than either habit strength taken alone, i.e., 

34.39 < 46.86 < 65.13. 

Therefore, 

Rsi < 

Generalizing from the above we arrive at our second corollary': 

II. The magnitude of the reaction tendency evoked by any one 
of a number of stimulus aggregates conditioned to the same reac- 
tion will be less than that evoked by two or more of them acting 
simultaneously. Empirical illustration of this corollary is seen in 
four of the five experiments cited above. Thus 4.4 is greater than 
either 3.5 or 3.6, and 3.9 is greater than either 2.2 or 3.7. The 
slightly discordant results of the fourth experiment are probably 
due to limitations in the size of the sample. 

Thirdly, suppose that three equally potent stimulus aggregates 
possess a joint strength of 71.76 habs. Now, 71.76 habs corresponds 
(Table 1) to twelve standard reinforcements. Thus the three stim- 
ulus components must have independent strengths equivalent to 
one-third of 12, or four standard reinforcements, each of which, by 
Table 1, corresponds to 34.39 habs. According to the principle of 
habit strength summation, two of these, if taken together, would 
summate to a habit strength equivalent to eight standard rein- 
forcements, which (Table 1) would yield 56,95 habs. But, 

34.39 < 56.95 < 71.76 
i.e., 

SiHr < SiHr "b sJRr < + S>Rb + Sr“S* 

It follows from this and the principle of the monotonic habit-reac- 
tion relationship that, 

R& < Rs, + St < Rsi + St+ St- 

Generalizing, we formulate our third corollary: ' ■ 

III. If a number of stm^llus aggregates, all equaJRy conditioned 
to the same reaction, impinge upon the organism simtdtaneously, 
the larger the number of such stimulus aggregates active on a given 
occasion, the greater uill be the amplitude of the evoked reaction. 
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THE PRIXCIPLES OF AFFERENT INTERACTION AND PRIMARY 
STIMULUS GENERALIZATION 

T!ie tliirci principle upon which the dynamics of reaction evoca- 
tion by stimulus compounds depends is that of afferent neural 
muracilon * Postulate 2, p. 47). In the present context that 
principle may be stated as follows p. 77) : Concurrent afferent 
impulses arising from the impact of distinct stimulus energies 
iSi on receptors are appreciably modified by each other before 
they reach that portion of the central nervous system where they 
initmte the efferent impulses (r) which ultimately evoke reac- 
iicm (E), 

The relevance of the afferent interaction hypothesis to the 
dynamics of stimulus compounds arises from the fact that two 
stimulus aggregates may, as we have already seen, act either inde- 
pendently of each other or in combination. But, by the principle 
of afferent interaction, the afferent impulses which originate in a 
pven stimulus aggregate are somewhat different on reaching the 
more central ponions of the nervous system when they occur con- 
currently with the afferent impulse arising from the action from 
some other stimulus energy than when the second afferent impulse 
is not ixeurring. Thus, two stimulus energies St and when act- 
ing separately initiate afferent impulses St and s^, but if acting at 
the same time th^e afferent impulses interact, changing each other 
to some extent ^ that by the time they reach the more central 
portions of the ner\'ous system s^ has changed to Si, and has 
changed to Sf. 

Now, suppose that and Si have each been conditioned sepa- 
ratei} to reaction, E. This process would produce the relationship, 

>r 

and 

>T 

Hcfweier, when Sj and Sg act simultaneously as a stimulus com- 

puBd, tie afferent impulse which tends to r >R in the one case 

ii not -% but and in the other case it is not Sg but sg. 

At this piint the principle of afferent interaction in stimulus 
m mpphmmtM by the principle of the primary stinv- 
§€mr^im^ grmdknt (Postulate 4, p. 178). In the present 
mm may be wmUiM m follows: The habit strength at the 
of m r^mmt im^be is a decreasing growth function 



COAiPOUND CONDITIOXED STIMULI 2 1 7 

of the dinerence \di betv:ecn the evoking aderenf imptdse (S| and 
the affererd impulse (.§» originally conditimed to the reaction, Tims 
tiie effective habit strength commanded b}’ I| will be less than that 
commanded by Si as originally conditioned^ i.e., 

hHs ^ tiffs* 

From the two last mentioned principles there may be deduced 
a number of corollarieSj some of which limit the generality of the 
eorollaries just derived, and vice versa. Actually, al! the principies 
here under discussion are operating in eveiy^ hypothetical situation 
considered in the present chapter; in the interest of expository 
clarity their action is here taken up separately. They will be con- 
sider^ jointly in Chapter XIX (p. 349 ff.). 

SO^IE COROLLARIES OF THE PRINaPLE OF AFFERENT INTERACTION 
AND OF PRIMARY STIMULUS GENERALIZATION APPLIED 
TO STIMULUS COMPOUNDS 

From the inequality last considered and the monotonic habib- 
reaction relationship it follows that, 

Generalizing, we arrive immediately at our fourth corollary: 

IT. If a stimulm aggregate has been conditioned to a reaction^ 
and if, later, this stimulvs aggregate is presented to the sitbject in 
conjunction with an alien stimulus aggregate not hitherto condi-- 
tioned to the reaction in question^ the strength of the reaction ten-- 
dency evoked by the stimulus combination will be less than will 
that evoked by the stimulus aggregate as originally conditioned. 

This corollary is exemplified in one form of w^hat Pavlov called 
external inhibition, i.e., the form where the ^"extra'^ stimulus w^hich 
produces the external inhibition does not itself evoke a reaction 
conflicting wdth that normally evoked by the conditioned stimulus. 
In this connection Pavlov remarks (6, p. 45) : 

If one experimenter had worked with a dc^ and established mme firm 
and stable conditioned reflex, conducting numerous experiments with them, 
when he handed the animal over to another experimenter to work with, 
ail the reflex^ dmppeared for a consklerable time. The same thii^ 
happen^ when the dc^ was changed over imm one xe^rch rmm to 
another. 

The interpretation is that the change in the stimulus situation 
product by the pr^ence of a new experimenter or by a different 
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rcx 5 m so modified the afferent impulses arising from the original 
conditioned stimiilns that its reaction evocation power sank below 
the reaction tlireshold id, p. 78i. 

'Wlm appears to be the same thing but in an even clearer form 
has been reported by Lashley i o, pp. 140-141). Rats were trained 
to jump to a black card bearing a white triangle and not to jump 
to a similar card bearing a white cross of the same area. After 
the animals had attained more than 95 per cent of correct choices, 
new eards were substituted w’hieh contained the same figures, with 
four small figures added. The score of “correct” choices with the 
new cards fell to 90 per cent. The interpretation is that the four 
additional figures so changed the afferent impulses arising from the 
triangle and the cross that the effective habit strength at the com- 
mand of each was appreciably weakened, which naturally decreased 
the accuracy of the discrimination. It would appear that this sec- 
ondary principle (external inhibition) is operating on a very large 
scale in al! the higher organisms including ourselves, wuth whom it 
is often called “distraction of the attention.” 

A variant of the situation just considered is that in which two 
stimuli have been conditioned to the same reaction separately and 
then are presented to the organism simultaneously. The interaction 
of the two afferent impulses upon each other when occurring con- 
ciurently may be expected to reduce the effective habit strength of 
each so that the amplitude of the reaction evoked by them jointly 
will be appr^iably less than would result from the simple summa- 
tion of the two habit strengths. Thus suppose that Si and Ss have 
each been conditioned separately to R to the extent of 70 habs, 
and that the afferent interaction effect of each on the other is 
of an extent sufficient to change the effective habit strength of 
Bud 'sJIm froni 70 to 40 habs each. Now, the physiological 
summatiim of tw^o habit strengths of 70 habs each yields 91 habs, 
wtereas the summation of two habit strengths of "40 habs each 
yields 64 habs. Accordingly, the summation mechanism alone 
vomM reduce the effective habit strength of the two stimuli to 
9i habs, and the interaction mechanism would reduce it still fur- 
ther — from 91 hate to 64 habs, which is less than the original value 
of the habit strength commanded by each individual component. 
It is evident that if the interaction effects are sufficiently great they 
Bay completely netitralixe any summative effects otherwise re- 
Mltiiig tmm cOTibining homogeneous conditioned stimuli, and even 
m leactBm tendency than that evoked by either stim- 
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ulus aggregate acting alone. Our fifth corollan^ accordingly is as 
follows: 

V. Ij two stt77iuli separately and equally conditioned to tlhe- same 
reaction are later presented simidtaneonsly, and their afferent neural 
interaction effects are sufficiently great, they will jointly evoke a 
■weaker reaction tendency than woiUd be mediaUd by either one 
of the stimuli acting alone. Corollaiy' Y, it will be observed, con- 
stitutes a limitation on the generality of Corollaries I and II. the 
extent of the limitation depending on the magnitude of the afferent 
interaction involved in the change of stimulus conditions. 

An experimental example of some historical interest illustrates 
this point. Shepard and Fogelsanger 17) had subjects learn paired 

nonsense syllables in the arrangement A >C and B tC. 

Later, syllable A and syllable B w’ere presented together. It was 
found that instead of decreasing the reaction latency of the evo- 
cation of syllable C (which would result from the physiological 
summation of two habit strengths), the joint presentation actually 
increased the ■weighted mean latency from 1135 milliseconds to 
2638 milliseconds, the two latencies yielding a ratio of 1 to 2.323. 

In the light of Corollary V these results present no paradox 
whatever. However, in some quarters they have produced a certain 
amount of confusion in the interpretation of rote-learning phe- 
nomena. For example, Kohler [4, p. 316) remarks of this particular 
experiment, 

If it were not for organization one should expect that, both excitants 
working in the same direction, the syllable associated with them would 
1^ more easily reprcxiuced than a syllable for which there was only one 
excitant- But the contrary was oleerved; it as though some 

inhibition were in the way of reproduction when it was arou^ by two 
excitants. 

According to the present vie-w, no inhibition w^hatever is involved 
in the determination of the Shepard-Fogelsanger results, and the 
unspecified organization or configurational factor is in reality a 
combination of afferent neural interaction and stimulus general- 
ization. 

A sixth corollary arises from a revemal of the situation pre- 
sented by Corollary V. SuppK)se that tw^o equally potent stimulus 
a^egates, Si and St, have been conditioned jointly to the same 
reaction to the extent of 65.1 habs, and that later Si is presented 
to the subject separately. Since (Table 1) 65,1 habs eorres|K)nds 
to ten standard reinforcemaits, simple habit summation dynamics 



220 


PRINCIPLES OF BEHAMOR 


m-oiild frive the equivalent of five reinforcements^ or 40.95 habs. 
Now. the interaction between afferent impulses arising from Si and 
Si will ciiange what otherwise w'ould have been Si to Si, Since 
t!ie reaetion is actually conditioned to Si (rather than to Si), it fol- 
lows that later, when Si is presented separately from Sb and other 
dynamic siimuli, the afferent impulse thus sent into the central 
nervous system will be Si, rather than Sj. But by the stimulus 
generalization gradient. Si will command a weaker effective habit 
strength than will and so will evoke a w’-eaker reaction tendency 
than is proper for a habit strength of 40.95 habs, depending upon 
the amount of afferent interaction w'hich occurred in the original 
conditioning situation. If the interaction effects are of sufficient 
intensity, the effective habit strength at the command of one of 
the stimulus aggregates under the assumed conditions may not be 
greater than half that of the original stimulus compound Si and 
Sg: indeed, it may be even less. Our sixth corollary is accordingly 
formulated as follows: 

VI. If two stimulus aggregates, jointly and equally conditioned 
to a reaction, are later presented separately, the reaction strength 
evokable by each may be less than habit summation dynamics 
would indicate, and in extreme cases may be even less than half 
that of the compound originally conditioned, depending upon the 
Q'fmunt of afferent neural interaction effects in the original condi- 
timiTig sitmtion. It will be noted that Corollary VI also consti- 
tutes a limitation on the generality of Corollaries I and II, the 
extent of the limitation depending on the amount of afferent inter- 
i^tion invoh'ed. 

Presumptive critical illustrations of Corollary VI are found in 
those cases of w’hat Pavlov calls the ^‘overshadowing” of one stim- 
ulus eompm^t in a conditioned stimulus compound by another 
cdBfxmeit 1^, 1 ^. 142-144). For example, in one case a stimulus 
ccisfWKmd made up of a tone and three electric lamps tvas con- 
ditMM^ to the salivary reaction so that the compound wrould evoke 
eight drofB of saliva in 30 seconds, though the lamps acting alone 
evoked no secretion whatever. On the above assumptions the 
lamps might very well have contributed materially to the joint 
reacticm of eight drops, yet ha've been so weakened by the influence 
of affotmt iatemtion under wliich conditioning took place as not 
te tte re^ticn fljr^hold when presented separately. The 
of also demand that the tone wffien pre- 

aloM s^mld have evok^ less than eight drops in SO sec- 
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ends; iinfcrtnnately PavloY does oot give the results of this con- 
trol test. 

STATMARY 

Some writers, in an attempt to simplify the account of the learn- 
ing process, have left the erroneous impression that the conditioned 
stimulus, e.g., as represented by S in the symbol bHm, is a simple 
or singular energy operating on one receptor end organ or, at most, 
on a small number of such end organs of a single sense mode. In 
actual fact an immense number of receptor end organs are involved 
in every conditioning situation, however much it may have been 
simplified by experimental methodolog}’. Each stimulus ''object'' 
represents a very complex aggregate of more or less alternative 
potential stimulations, often extending into numerous receptor 
modes. 

Presumably every receptor which discharges an afferent impulse 
during the conditioning process acquires an increment of habit 
strength at each reinforcement. There is much reason to believe, 
however, that the magnitude of the increments acquired by the 
different receptor organs or receptor-organ aggregates varies 
greatly. Among the factors which are believed to favor the acqui- 
sition of large increments are: the dynamic or changing state of 
the stimulus energy, the intensity of the stimulus energy, the 
nature or "mode'^ of the receptor stimulated, the relative rarity of 
the occurrence of the stimulus energy, and the chance that the 
stimulus energy may previously have been conditioned to some 
steongly emotional reaction. 

Despite the great role that stimulus compounds play in adaptive 
behavior, no imique primary principles have been found to be 
operating. One major principle which seems to be active is that of 
the summation of the habit-strength loading borne by the several 
stimulus elements or aggregates of a stimulus compound: The habit 
strengths borne by the several stimulus components summate to 
form a single effective habit strength which is equal to that which 
would be produced by a number of consecutive standard reinfon^ 
ments equal to the sum of the number of remforcements which 
would be required to produce the separate habit strongths !x)me 
by the several stimulus components. The actiem of the principle 
of habit-strength summation is a^c^iated with a second primary 
principle, the monotonic habit-reaction function. This is that the 
strength of a reaction t^dency, other things equal, is a mono tonic 
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increasing function of the effective strength of the habit, and so of 
the reaetion potential «p. 226 ff.) mediating it (Postulate 15, p. 

344ff.N 

Frcm these principles follows the first corollary, that the 
strength of the reaction tendency evoked by two homogeneous con- 
ditioned stimulus aggregates when acting in a stimulus compound 
is less than the arithmetical sum of the t-wo reaction tendencies 
evokable by the components when acting separately. A second 
corolkiy closely related to the first is that the joint reaction ten- 
dency evoked by two stimulus components is greater than that 
evokable by either component acting separately. Both of these 
coroliari^ find experimental illustration imder certain conditions. 
A third corollary of the same group is to the effect that if a num- 
ber of stimulus aggregates are equally conditioned to a reaction, 
and vaiying numbers of them later simultaneously act on the recep- 
toi^ of the organism, the greater the number of aggregate so acting, 
other things equal, the greater will be the strength of the reaction 
tendency thus evoked. 

A third primary principle which, along with those just con- 
sidered, is believed to be active in determining the dynamics of 
stimulus compounds, is that known as afferent neural interaction. 
This principle is to the effect that when afferent receptor discharges 
c^cur at about the same time, they interact, changing each other 
to vaiy'ing degrees depending upon circumstances not as yet well 
known (Peculate 2). AsscKiiated with the interaction principle is 
the familiar principle of the primary stimulus generalization gradi- 
ent I Postulate 5). 

From these latter primary principles there follow some addi- 
tional corollari^. One of these is: In case a conditioned stimulus 
abrogate k printed in a compound with a second or ^kxtra” 
this stimulus compound, other things equal, will 
evoke a weaker leaction tendency than will the stimulus aggregate 
oiipnally conditioned. An example of the action of this principle 
k found m one type of Pavlovian external inhibition. 

NOTIB 

IkiiMtioa for the Ccmbiimtkn of the Habit loadings Borne by the 
of a Conditioned Sthnulus Compound When 
^ CompoiMits Are Two in Humber 

m dtewed % E^. D. T. Berldm frmn an equival^t of Ihe 
^ boaiie by ^^naits of compemnd 
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conditiooed stimuli, in connection with the combination of two generaibed habit 
tendenci^ evokabie by a particular stimulus compound (i). Recast in a form 
appropriate for the pr^nt context, this equation is: 

rr - H ^ Tf iv>\ 

— sMb "t" {dZ) 

where $i is one afferent element or aggr^ate of a conditioned stimulus compound 
and sa is another such element or aggregate of the same conditioned stimulus 
compound, H is the power of the stimulus, reprinted by Si or s«, when com- 
bined with suitable motivation to evoke the reaction (R), and M is the physio- 
Ic^cal limit of conditioning strength imder the circumstance in which the condl- 
tionii^ occurred. 

It may easily be shown that the above equation is a special case of Day^s 
general equation (30) for combining any number of habit strengths (XII, p. ^30). 


An Example of the Combination of the Habit Strengths of Tw’o Condi- 
tioned Stimulus Elements or Aggr^ates by Cleans of Perkins' Equation 

In connection with Corollaiy V (p. 218) there was occasion to determine the 
combined habit strengths of two conditioned stimulus elements or a^r^;ate 
delivered simultaneously as a stimulus compound, each element by hjTpoth^is 
canying a habit loading of 40 habs. Because the expedition in the main text 
is designed for non-mathematical readers, such combination values are usually 
found by means of a table based on the simple growth function. The same out- 
come could, however, have been secured in all cases by the use of Perkins* equa- 
tion, as was actually done in the case of Corollary V. In that example JiHr = 
40 hald and also m^Hs = 40 hahd. Accordingly, suldtituting th^ values in 
Perkins* equation (32), we have, 

. ^ An An 40X40 

40 "p 40 -* " 


80 


1600 
100 

= 80-16 
+ = 64 


The Difficulty of Applying the Habit-Summation Equations in Quantita- 
tive lietaii to Concrete Behavior Situations 

A minor difficulty in the way of appMng the habit summation equations to 
concrete behavior situations lies in the fact (p. 253 ff.) that habit must be com- 
fcaned with a drive (D) or motivation before the stimulus can evoke a reaction. 
If, however, both habit strength and reaction-evocation potential are on a eenti- 
gra<^ scale, and the drive is eho^n in ®ich a way that its function wl^ comlnifeed 
multiplic^tively with that of s3b is unity, no serious difficult will arise, 
it wiH not be po^ble to take up tiie matter of motivation until a later chapter, 
TiA thing has b^n said alxMit this, lest the rmder be uni^cmarily confu^d. 

A major difficulty ties in tte fact that a lar^ number of the stimulus elements 
always present in any conditioning titimtion can never be under direct and rmdy 
contrerf of tl^ inv^tigator. As a result can neiti^ be administered 
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mtelf fmm th(m deiilierately empioj’ed by the investigator (and usually of 
pnimxy interest to Mm) nor wholly eliminated from experimental situations in 
which the experimenter seeks to determine the habit strength of certain stimulus 
such as the light or the vibrator employed in the galvanic conditioning 
n discussed above. If we let the aggregate habit loading of th^ 
n urrcontroEable factors represented by x, then instead of saying that 

the g.dvarJe reaction evoked by the light alone averaged 2.2 millimeters, we should 
sav tliut the action evoked by the light together with x unspecified and unmeasured 
ekmmli in the cmditiork&i giimidus situcUion averaged 2.2 millimeters. In a similar 
manner, we should ^y that the conditioned galvanic skin reaction evoked by the 
joint action of the light, the vibrator, and x unspedjied and unmeasured stimulus 
ekffmiU averaged o.l millimeters. It accordingly comes about that in the 

forsmla the :c-value appears twice in the denominator but only once in 

Ri 

tM numenator, thm giving a smaller mdue to the ratio than it properly should have. 
It mi^t be ^ppc^ed that the habit loading of these x stimulus elements 
wold calculated by mmns of Day’s equation (XII, p. 200 ). As a matter of 
imt this probably wcwld be pc^ible if we knew the value of the constant, M. 
Oik the other hand, the value of M could be calculated if we only knew the habit 
kmding of the x stimulus elements. Perhaps the most promising possibility of 
«sping from this dilemma in the determination of the value of the constant 
M by some quite independent procedure. Meanwhile a considerable number of 
qu^-qu^titative but empirically tetable theorems may be derived from the 
equations even umkr present conditions. Three of these have been outlined 
as eordkries basKi mainly on the h3rpothesis upon which both equations 
ckpend. 

Finally, eonj^rtoe may be ap^nded that possibly these x stimulus 
Aments, such as tte cutanemis stimulations normally resulting from the contact 
of the with its support and the multitude of proprioceptive stimulus 

rmilting fmmi tte posture of the organism, possess a relatively weak 
cavity for acquiring conditioned habit loadings owing to the fact that since 
they are nmre or ubiquitous they must become conditioned in every condi- 
tioning situation the oii^nism encounters. Unless these situations become very 
Mghly patterned on the stimulus side, it follows that stimulus elements wMch are 
coriditios^ to all kinds of reactiom would ultimately become permanently 
^infukhed and thus finally immui^ to any further conditioning. This of cour^ 
wotiH bold im immi or customary posture of the organism, but not for rare or 
b grmt need for experimental research in this field,. 
l*jt ttm pislteii m a difficult one. 


REFERENOEB 

L fioj.. C. L. problem of stimulus equivalen<^ in behavior theory. 

Pfpckal. Mev^ 1 ^, 

X fimx, G. L, &piori.tioaa in the pmtteming of stimuli conditioned to the 
^ GBM. /. IMI, f7, 95-110. 

X G, L. Gc^dit»ciring : Outlii^ of a syi^ematic theory of learning. 

H la of Imtmmg (fmty-first yearbook, Ha- 

fm the St^y of Education). Bloomington, III.: Public 

Co., 



COMPOUND CONDITIONED STIMULI 


225 


4. Kohlsj, W. Gestalt psychology. Xc-w York: Liveright, 1029. 

5. L.^shley, K. S. The mechanism of vision: XV. Preliminary studies of 

the rat's capacity for detail vision. J. Gen. Psychol, 1938, IS, 123-193. 

6. Pa%xov, I. P. Co\dit:Q?:£d reflexes (trans. by G. V. Anrep). Loncloa: 

Oxford Univ. Press, 1927. 

7. Shep.^, J. P-f and Fcxselsangee, H. M. Studies in as^ciation and inhibi- 

tion. Psychol, Eev., 1913, 20j 290-311. 



CHAPTER XIV 


Primal)’ Motivation and Reaction Potential 

It may be recalled that when the problem of primary rein- 
forcement was under consideration (p. 68 ff.), the matter of or- 
pnic need played a critical part in that the reduction of the need 
constituted the essential element in the process whereby the reac- 
tion was conditioned to new stimuli. We must now note that the 
state of an organism’s needs also plays an important role in the 
causal determination of which of the many habits possessed by an 
organism shall function at a given moment. It is a matter of com- 
mon obseiwation that, as a rule, when an organism is in need of 
food only those acts appropriate to the securing of food will be 
evoked, whereas when it is in need of water, only those acts appro- 
priate to the securing of water will be evoked, when a sexual hor- 
mone is dominant only those acts appropriate to reproductive 
activity mil be evoked, and so on. Moreover, the extent or inten- 
sity of the need determines in large measure the vigor and persist- 
ence of the activity in question. 

By common usage the initiation of’Ieamed, or habitual, pat- 
terns of movement or behavior is called motivation. The evocation 
of action in relation to secondary reinforcing stimuli or incentives 
will be called secondary motivation; a brief discussion of incentives 
was given above fp. 131) in connection with the general subject of 
amount of reinforcement. The evocation of action in relation to 
primarv- needs will be called primary motivation; this is the subject 
of the present chapter. 

THl EMPIBICAIi BOIES OF HABIT SIBBNGTH ANB DRIVE IN THE 
DETERMINATION OF ACTION 

Ca^al observations such as those cited above often give us 
valuable clues concerning behav’ior problems, but for precise solu- 
tions. controlled quantitative experiments usually are necessary. 
In the present ccmtext we are fortunate in having an excellent em- 
pirical «udy which shows the functional dependence of the per- 
of food-s«^ng behavior jointly on (1) the number of 
leinfwmMats of the habit in question, and (2) the number of 
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of food privation. Perin {r2i and Williams (fOi trained 
albino rats on a simple bar-pressing habit of the Skinner type 
(p. 87 j 5 ghing separate groups different numbers of reinforee- 
ments var\dng from 5 to 90 under a standard 23 hours^ hunger. 
Later the groups were subdivided and subjected to experimental 



PiQ. 4 S. Column diagram of the Perin-Wiiliams data showing quantita- 
tively how the resistance to experimental extinction in albino rats varks 
jointly with the number of reinforcements and the numba* of hours of fcKKi 
privation at the time the extinction occurred. The cro^hatched columns 
represent the groups of animals reported by Willianis iB)); the non-hatch^ 
columns represent the groups reported by Perin. (Figure reproduced from 
Perin, IS, p. 106.) 

extinction ^ with the amount of food privation varying from 3 to 
22 hours. 

Ttie gross outcome of this experiment is shown in Figure 48, 
where the height of each column represent the relative mean num- 
hev of imreinforced reactions performed by each group l^fore ex- 
perimental extinction yielded a five-minute pause between succes- 
sive bar pr^ures. The positions of the twelve columns on the 
ba^ shows clearly the number of reinforcanente and the number 

^ For an aix^unt of experimenbd extincticm, see pp. 258 ff. 
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Wm. Graphic representation of the data showing the systematic rela- 
tloB^p l^tween the r^istance to experimental extinction (circles) and the 
nninl^r of honrs" food privation where the number of reinforcements is 
mnstant at 16. The smooth curve drawn through the sequence of circles 
repre^nts the slightly pc^itively accelerated function fitted to them. This 
function is believed to hold only up to the number of hours of hunger em- 
ployed in the origisal habit formation process: in the present case, 23. (Fig- 
ure a^pted from Perin, 12, p. 164.) 



GrafJiic repr^ntation of the two ‘learning” curv^ of Figure 
in ttie SUM p^e ta facHitale compari^n. The solid circle repre^t 

to the heighte of the relevant columns 
M mm fesficwr dirdte represents a ^ghtly intepolated value. 

drawn anaoi^ each set of drete reprint the simple 
to meh set of eznphic^ cbta. (Figure adapted from 

m, p. mu 
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of boure' food privation which producal each. It is evident from 
an examination of this figure that both the number of reinforce- 
ments and the number of hours of food privation are potent factors 
in determining resistance to experimental extinction. Moreover, 
it is clear that for any given amount of food privation, e.g., 3 or 22 
hours, the different numbers of reinforcements yield a close approxi- 
mation to a typical positive groinh function. On the other hand, 



Fio. 51. Three-dimensioiial graph representing the fitted mire^ 

^xsnding quantitatively to the action of the number of reinforcemente and the 
number of hours of food privation following ^tiation, in the joint determina- 
tion of the number of unreinforced acts of the originally ©Dtndition^ 
which are required to produce a given degree of experimental extinction. 
(Figure adapted from Perin, IB, p. KB.) 

it is equally clear that for a given number of reinforcements, e.g., 
16, the number of hours of food privation has an almcBt linear 
fxmctional relationship to the resistance to experimental extinctioii. 

For a more precis analysis of thes 5 functi<mal relationships it 
is nee^sary to fit two-dimensional curves to iie data. The results 
of this procedure are presented in Figure 49 and Figure 49 
shows that r^istance to extinction at the 16-reinforcement level is 
a slightly positively accelerate function of the number of hours^ 
focxi privation for the fimt 22 hours. Figure M) shows that a pcm- 
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live growth function fits both ‘'ieaming’' cur\^es fairly well. An 
examination of the equations which generat-ed these curves reveals 
that the asvmptotes differ radically, clearly being increasing func- 
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Tm. 52. Graph ^hcwin? the relation- 
ship cf the I'iCtioD potentiality as a 
function of the length of food priva- 
tion folbwing satiation. Fiist note the 
fact that there is an appreciable 
ainoum of action potentiality at the 
beginning of this graph, where the 
amount of fcK>d privation is zero. Next, 
ol»n'e that the enrv’e is relatively 
high at one day of focwi privation, 
which was the degree of drive under 
which the original training occurred. 
Finally, note that the rise in action 
is fairly continuous up to 
£vr days, after which it falls 
rather sharply. This fall is evidently 
dee to exhaustion, as the animals died 
mmm after. Hie function plotted as 
th^ snM^th curve of Figure 49 corre- 
‘sponds only to the first i^ction of the 
pr€-a«nt friph and el«irly dc^s not rep- 
rint the functional relation^p be- 
ycnd a point where the numW of 
feeure- cf fcMx! pri^^aiicm m greater than 
C Figure repitMiu^d from Skinner, 
m p. m.} 


lions of the number of hours of 
food privation, but that the 
rates at wdiich the curves ap- 
proach their respective asymp- 
totes are practically identical 
(F equals approximately 1/25 
in both cases) . Finally it may 
be noted that both curves, 
when extrapolated backward to 
where the number of reinforce- 
ments would equal zero, yield a 
negative number of extinctive 
reactions amounting to approx- 
imately four. This presumably 
is a phenomenon of the reac- 
tion threshold which will be 
discussed in some detail later 
(p. 322) ; it is believed tq mean 
that a habit strength sufficient 
to resist four extinction reac- 
tions is necessary before reac- 
tion will be evoked by the 
stimuli involved. 

For a final examination of 
the outcome of the experiment 
as a whole, the curves shown in 
Figures 49 and 50 were syn- 
thesized in such a way as to 
yield a surface fitted to the 
tops of all the columns of Fig- 
ure 48. This surface is shown 
in Figure 51. An examination 
of this figure reveals the impor- 
tant additional fact that when 


the wrface m citea|M)latrf to where the number of hours’ focKi pri- 
is ^10, tte r^istance to exf^mental extinction presumably 
wll rtifl show s pi»live gtwto function with n-values of consid- 
As a matter of fact, the asymptote of the growth 
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function where A = 0 ( satiation t 25 28 per cent of that where h = 
22 hours. 

These last results are in fairly good agreement with comparable 
values from several other experimental studies, ileasurements of 
one of Skinner's published graphs, reproduced as Figure 52, indicate 
that his animals displayed approximately 17 per cent as much food- 
seeking acthity at satiation as at 25 hours' food privation. Finch 
1^,1 has shown that at satiation a conditioned salivary reaction in 
nine dogs yielded a mean of 24 per cent as much secretion as wms 
yielded at 24 hours' food privation. Similarly, Zener i£2i reports 
that the mean salivaiy secretion from four dogs average at satia- 
tion 24 per cent as much as at from 21 to 24 hours' food privation. 
The considerable amounts of responsiveness to the impact of con- 
ditioned stimuli w'hen the organism is in a state of food satiation 
may accordingly be considered as well established. 

The continued sexual acti\dty of male rats for some months 
after castration points in the same direction. Stone (15\ imports 
that male rats which have copulated either shortly before or shortly 
after castration, when an adequate supply of hormone wmuld be 
present, continue to show sexual behavior sometimes as long as 
seven or eight months after removal of the testes- According to 
Moore et al. {10), Stone {15), and Beach il ), this operation re- 
moves within 20 days not only the source of testosterone but, 
through the resulting atrophy of accessoiy glands, also the source 
of other specifically supporting secretions. A few^ w’eeks after cas- 
tration, therefore, when the normal supply of sex hormones in the 
animal’s body has been exhausted, the sex drive is presumably in 
about the same state as is the focd drive after complete food 
satiation. The continued sexual activity of these animals thus pre- 
sents a striking analogy to the continued operation of the food- 
release bar by Perin’s rats after fcK>d satiation. While not abso- 
lutely convincing, this e\ddence from the field of sexual behavior 
suggests that the performance of learned reactions to moderate 
degrees in the absence of the specific drive involved in their origi- 
nal acquisition may be sufficiently gena*al to apply to all primary 
motivational situations. 

Closely relate to this same aspect of Perm's investigation is 
a study reported by Elliott (S), Albino rats were in a maze 

under a thirst drive with water as the reinforcing agent until the 
tale path was nearly learned, when the drive was suddenly shifted 
to hunger and the reinforcing agent to focKl. The outcome of this 
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procedure is shown in Figure 53. There it may be seen that on the 
first trial under the changed condition of drive there was an appre- 
ciable disturbance of the beha\-ior in the form of an increase in 
locomotor time; there was also an increase, of about the same 
proportion, in blind-alley entrances. On the later trials, however, 


»Qi 


Control 

j Experimental 

t 



Fis. 5S. Graplia dbowin^ the disruptive influence on a maze habit set up 
m albino rats on the Imins of a water reinforcement, of having the drive (on 
tke tmtb mMWenly dbifted from thirst to hunger. (Reproduced from 

f # p. 

tte leamiag proc^ appeared to proceed much as if no change 
had been made in the experimmtal conditions. 

As a final item in this ^ries there may be mentioned an em- 
pirical o^riratiiM of Pavlov concerning, the effect on an extin- 
guktei ©fflidiriem^ :rea€ti<m of incre^ng the drive. On the anal- 
of Perin^ aperimentj it might be expected that this would 
^ wmaUrn evocable by iJie sMmulus; and this in 
pk«- In tiyb t^nme^on Pavlov ranarks (II, p. 127)j. 
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To illustrate this last condition 
we may take instances of differen- 
tial inhibitions established on the 
basis of an alimentarj- reflex. If, 
for example, the dog has been 
kept entirely without food for a 
much longer period than usual be- 
fore the experiment is conducted, 
the increase in excitability of the 
whole aiimentar}" nerv'ous mech- 
anism renders the previously es- 
tablished differential inhibition 
wholly inadequate. 

EMPIRICAL DIFFERENTIAL 

REACTIONS TO IDENTICAL 
EXTERNAL ENVIRONMENTAL 
SITUATIONS ON THE BASIS 
OF DISTINCT DEIXTS 

A second important type 
of motivational problem was 
broached in an experiment re- 
ported by Hull (6), Albino rats 
w’ere trained in the rectangular 
maze shown in Figure 54. On 
some days a given animal 
would be run in the maze 
when satiated with water, but 
with 23 hours’ food privation, 
whereas on other days the same 
rat would be run when satiated 
with food but with 23 hours of 
water privation. The two types 
of days alternated according to 
a predetermined irregularity. 
On the food-privation days the 
reinforcement chamber always 
contained fcxxi and the left m- 
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Fig. 54. Diagram of the maze em- 
ployed in Hull’s differential drive ex- 
periment. iSi= starting chaml^r; (J 
= food chamber; D\ = dmrs man- 
ipulated by cords from the experiment- 
er’s stand; B', S" ^ barriem acro^ pas- 
sageway, one of which was aiwa^^ 
clo^. The coui^ puisued by a typ- 
ical rat on a false” run is shown by 
the sinuous dotted line. Note that die 
animal went down tii^ “wrong” dde of 
the far enou^ to ^ the dc^i 
d<x>r at B' and then turned around. 
(Repixxliio^ from HuU, d.) 


trance, say, to the chamber w^ blocked so that acc^ could be had 
only by traversing the right-hand side of the rectangle. On the 
water-privation days the reinforcement chamW always contained 
water, and the light-hand Kitrance to the reinforcement chamter 
would be blocks m that aec^ to the water could be had only by 
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traversing tlie left-hand side of the rectangle. The outcome of this 
experiment is shown in Figure 55. There it may be seen that while 
learning was very slow, the animals of the experimental group 
vr:. luallv attained a considerable power of making the reaction 
whicli corresponded to the drive dominant at the time. 

The capacity of rats to learn this type of discrimination was 
later demonstrated more strikingly by Leeper (8) , in a substantially 
similar investigation. ‘ keeper’s experiment differed, however, in the 
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Fia. 55. Composite graphs showing the per cent of correct choices at the 
ir^ trial of each experimental day in the discrimination by rats between 
hinder and thiist motivation {6, p. 263). 


detail that two distinct reinforcement chambers were employed and 
no were blocked at any time, so that if on a ‘ffood’' 

day rat weit to the water side he always found water, and if 
em a day he went to the food side, he always foimd food. 

Under ojnditions the animals learned to perform the motiva- 
ticHial di^iimination with great facility; keeper’s animals needed 
only ateut one-twelfth the number of trials required by the origi- 
ma! Hull tohnique, though again the proce^ of acquisition was 

m attributed m jmrt to the operation of ^mtial 
in fait to Unb that when rats are deprived of either 

imA m As Kjt a xbocmal amount of the oth^ sub- 
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DOIS THE PRINCIPLE OF PRIMARY SHMULIJS-INTENSITy 
geneeaxjzatiok apply to the DRI\’E SHMULES {Sd}? 

A factor with considerable possible significance for the under- 
standing of motivation is the relationship between the degree of 
similarity of the need at the time of reinforcement and that at the 
time of extinction, on the one hand, and the associated resistance 
to experimental extinction on the other. No specific experiments 
have been foimd bearing exactly on this point, but several inciden- 
tal and individually inconclusive bits of e\ddence may be men- 
tioned as indicating the general probabilities of the situation. 

The first of these was reported by Heathers and Arakelian (4). 
Albino rats were trained to secure food pellets by pressing ^ bar 
in a Skinner-Ellson apparatus. Next, half of the animals w’ere par- 
tially extinguished under a weak hunger, and the remainder w^ere 
extinguished to an equal extent imder a strong hunger. Two days 
later the animals were subjected to a second extinction, half of 
each group xmder the same degree of hunger as in the first extinc- 
tion, and the remaining half under a drive equal to the first-extinc- 
tion hunger of the other group. Combining the state of food priva- 
tion of the first and second extinctions, there were thus four hunger- 
extinction groups: 

1, strong-strong; 2, strong-weak; 3, weak-strong; 4, weak-weak. 

The authors report that a statistical pooling of the r^ults from 
the^ four grouj^ of animals revealed a tendency of the rats extin- 
guished twice on the same drive to resist extinction than did 
tiiose animals which were extinguished the second time cm a drive 
different from that employed on the first occasioiL In two inde- 
pendent studio this difference mnounted to approximately 4 and 
6 i:^ c^t resp^tively; the latter results are reported to have a 
probability of 8 in 10 that the difference was not due to chance. 
This experimental outcome is evidently relate to the primary 
^neralisation of stimulus intoamty and miggeste that permvemtive 

ths prevents geniune of tl^ satiated drive. 

For example, thirsty rats ^pp<^dly satiated with food will, after re<^iviiig 
ever a few dhops of water, wery genially if food m available (0, p. 2m) ; 
and rate, like hmnai^ freqi^tly drink while earing dry food if wata: is 
availabie. Thi^ after riie fir^ trial I^eperis animak were presumably oper- 
arii^ mKier both drives, and <me drivB or the other was reinforced no mat- 
tar whHi jmrii was teavarsed. 
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€xtin€thn effects are to some extent specific to the primary drive 
or need intemity under which the extinction occurs. 

Bj' analogy, the stimulus-intensity generalization gradient ap- 
parently found in the case of extinction effects just considered 
strongly suggests the operation of the same principle in the case 
of reinforcement effects. Now, such a gradient has been demon- 
strated experimentally by Hovland (see p. 186 ff.); it naturally 
has its neatest value (Figure 43} at the point of reinforcement 
Consequently it is to be expected that in a curve of motivation 
intensity such as that of Skinner (shown in Figure 52) a special 
elevation or inflection would appear at the drive intensity at which 
the original reinforcement occurred. Whether a mere coincidence 
arising from sampling errors or not, exactly such an inflection may 
te in Skinner's empirical graph at one day of food privation, 
which was in fact the drive employed by Skinner in the training of 
the animals in question. The present set of assumptions implies 
that if Skinner's curve as shown in Figure 52 were to be plotted 
in detail by hours rather than days, it would present a positive 
acceleration from zero to one day of food privation. Now, Perin^s 
study did plot this region in some detail, and Figure 49 shows that 
a pcBitively accelerated fimction was found. These facts still fur- 
ther increase the probability that the principle of stimulus-intensity 
^neralisation appli^ to the drive stimulus (Sp). 

THB INFLUENCE OP CERTAIN DRUGS ON EXPERIMENTAL 
EXTINCTION AND ITS PEBSEVERATIQNAL EFFECTS 

Certain drup are known to influence markedly the phenomena 
of o:|^iimental extinction. Switzer {18) investigated the effect of 
caffeine citrate on the conditioned galvanic skin reaction in human 
mmg a control d(^ of milk sugar. He found that caf- 
f«iie iiicf^»d iMstmce to experimental extinction; incidentally 
he fouiMi that caffeine increased the amplitude of the uncon- 
ditifliied galvamc skin reaction and decreased the reaction latency. 

Pavlov (II, p. 127) reported a somewhat related experiment 
j^rfonc^ by Hikiforovsky. An alimentary salivary conditioned 
refei had l^a up to a tactile stimulus on a dog^s forepaw. 
T^i mmUtm teidoicy ^leralized to other parts of the animal^s 
Am, includiiif a fxsnt cm the back which subsequently was com- 
dMnpistei At latta: st^ of training the stimulus 

the yidekd flve dreps of i^va during the first minute of 
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stimulation, whereas stimulation of the extinguished spot on the 
back yielded a zero reaction. Thereupon, the animal was given a 
subcutaneous injection of 10 c.c. of 1 per cent solution of caffeine. 
A few minutes later the stimulus when applied to the forepaw 
evoked four drops during the first minute, and when applied to 
the previously extinguished spot on the back, yielded three drops 
(il, p. 128), thus indicating a major dissipation of the extinction 
effects. 

Miller and Miles ( 9 ) have contributed to this field. They 
demonstrated in albino rats traversing a 25-foot straight, enclosed 
runway that an injection of caffeine sodio-benzoate reduced the 
locomotor retardation due to experimental extinction by about 
two-thirds. In the same study it "was shown that the retardation 
in locomotor time due to satiation was reduced by the caffeine 
solution approximately one-half (P). 

Benzedrine is another substance 'which when thrown into the 
blood stream has the power of greatly retarding the onset of experi- 
mental extinction. This was demonstrated by Skinner and Heron 
(14) to hold for the Skinner bar-pressing habit. 

SEX HORMONES AND REPBODUCnVE ACTIYITT 

As a final set of empirical obseiwations concerning motivation 
we must consider briefly the relation of sex hormones to repro- 
ductive behavior. Within recent years an immense amount of 
excellent experimental work has been performed in this field, though 
only brief notice of it can be taken in this place. An account of 
two typical bits of this work was given above (Figure 11 and 12). 
In a recent comprehensive summary by Beach (1) the follo'wing 
projK>sitions appear to have fairly secure empirical foundation: 

1. Animals of practically all species which through castration have 
bejome sexually unresponsive to ordinary incentive stimulation, become 
r^poiisi\"e promptly on the injection of the appropriate hormone — uai- 
aHy testosterone proprionate for males and ^rc^en for female. 

2 . Presumptively normal male rats differ greatly in their «xiial re- 
s|x^nswene^, all the way from thc^ which will attenpt eoprilation with 
inaiiimate objects to tho^ which will not react even to an extrmdy re- 
ceptive and alluring female, Tte inj^^ion of te^esterone usi&lly 

the rmctivity of ah but a few of the most Muggfeh animals. Alterna- 
tively, the pr^ntation of an esp^ially attmctive incentive tends to 
have the same objective effect, though to a le^r ( 17 ), 

3 . E^roction of the cerebral cortex decrea^ reactivity roughly 
in proportion to the extent of aich <fetniction, very much as occurs in 
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the case of food habits. If destruction has not been too great, injection 
Of the hormone will largely restore sexual responsiveness to appropriate 
meeritives. The presentation of an exceptionally attractive incentive will, 
however, have much the same effect upon the objective behavior of such 

orgiinhms. 

4. Virdn male organisms which are unresponsive to an ordinary re- 
ceptive femalej after a few copulations under the influence of an injec- 
tion of the hormone will remain responsive long after the hormone has 
presiumbly disappeared from the animal’s body. This is believed to be 
caused by the leaniing r^ulting from the incidental reinforcement which 
occurred when the animal was under the influence of the hormone ( 1 ), 

5. Many intact individuals of both sex^ in most species occasionally 
manifest a ponion of the behavior pattern characteristic of the oppo- 
site sex. Injection of the sex hormone of the opposite sex in castrated 
individuais of either sex tends strongly to the evocation of the sexual be- 
havior pattern characteristic of normal organisms of the opposite sex on 
appropriate stimulation; this, however, is not usually as complete as the 
gross anatomical equipment of the oiganisms would seem to permit. Curi- 
ously enough, large doses of testosterone given to male rats make possible 
the elicitation of ail elements of the tv^ical female sexual behavior (1). 

primary motivational concepts 

With the major critical phenomena of primary motivation^ 
now before us, we may proceed to the attempt to formulate a theory 
which will conform to th^e facts. 

At the outset it will be necessary to introduce two notions not 
previously discussed. These new concepts are analogous to that 
of habit strength (sHr) w’hich, it will be recalled (p. 114), is a 
logical construct conceived in the quantitative framework of a cen- 
tigrade sv’stem. 

The first of the two concepts is strength of 'primary drive; this 
is represented by the sjunbol D, The strength-of-drive scale is 
to extend from a zero amount of primary motivation 
leOTplete satiation} to the maximum possible to a standard organ- 
ic of a pven s|^i^ In accordance with the centigrade principle 
this range of primary drive is divided into 100 equal parts or units. 
For convenience and ease of recall, this unit will be called the mote^ 
a contraction of the word motivation with an added e to preserve 
aormal pmnuneiation. 

of the practical exigencies of exposition the second of 

ph&aoiaeiia of groondary motimtion, including such ma^ 
m (p. ISlff,), fmctional anticipatory goal- and subgoal-reac- 

be in volume beimuse space is not available. 
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the new concepts has alreadj" been utilized occasionally in the last 
few pagesj where it has been referred to as the ^'reaction tendency/' 
a term in fairly general use though lacking in precision of meaning. 
For this informal expression we now substitute the more precise 
equivalent, reaction-evocation 'potentiality; or, more briefly^ reac- 
tion potential. This will be represented by the symbol gEs- 
Like habit (sH^) and drive (D), reaction-evocation potential is 
also designed to be measured on a lCK)-point scale extending from 
a zero reaction tendency up to the physiological limit possible to 
a standard organism. The unit of reaction potentiality will be 
called the waf, a contraction of the name Watson, 

It should be evident from the preceding paragraphs that D and 
sEe are symbolic constructs in exactly the same sense as (see 
p. Ill ff.), and that they share both the advantages and disadvan- 
tages of this status. The drive concept, for example, is proposed 
as a common denominator of all primary motivations, whether due 
to food privation, w'ater privation, thermal deviations from the 
optimum, tissue injury, the action of sex hormones, or other causes. 
This means, of course, that drive will be a different function of 
the objective conditions associated -with each primary motivation. 
For example, in the case of hunger the strength of the primary drive 
will probably be mainly a function of the number of hours of food 
privation, say; in the case of sex it will probably be mainly a func- 
tion of the concentration of a particular sex hormone in the ani- 
maFs blood; and so on. Stated formally, 

D = jih) 

D = i{c) 

D = etc., 

where h represents the number of houm of food privation of the 
organism since satiation, and c represents the concentration of a 
particular hormone in the blood of the organism. 

Turning now to the concept of reaction-evocation potentiality, 
we find, thanks to Periods investigation sketched above (p. 227 ff.), 
that we are able at once to define bEm as the product of a function 
of habit strength (sHe) multiplied by a funcMon of the relevant 
drive (D). This multiplicative relationship is one of the greatest 
importance, because it is upon s^b that the amount of action in 
its various forms pr^mably depends. It is clear, for example, 
that it is quite impossible to pmiict the vigor or persistence of a 
given type of action from a knowledge of either habit strength or 
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drive strength alone; this can be predicted only from a knowledge 
of the product of the particular functions of sHr and D respec- 
tively; in fact, this product constitutes the value which we are 
representing by the s}'mbol sEr* 


SUMMABY AND PRELIMINART PHTSIOLOGICAI. INTERPRETATION 
OF EMPIRICAL FINDINGS 

Having the more inaportant concepts of the systematic approach 
of primary motivation before us, we proceed to the formulation of 
^me empirical findings as related to motivation. 

Most, if not all, primary needs appear to generate and throw 
into the blotxl stream more or less characteristic chemical sub- 
stance, or else to withdraw a characteristic substance. These 
substances (or their absence) have a selective physiological effect 
on more or les restricted and characteristic portions of the body 
(e.g., the so-called ^^hunger” contractions of the digestive tract) 
which servTS to activate resident receptors. This receptor activa- 
tion constitutes the drive stimulus, Sr (p. 72 ff.). In the case of 
ti^e injury this sequence seems to be reversed; here the energy 
producing the injury is the drive stimulus, and its action causes 
the release into the hlmd of adrenal secretion which appears to be 
the phj'siologieal motivating substance. 

It s^ms likely, on the basis of various analogies, that, other 
things equal, ihe intensity of the drive stimulus would be some form 
of negatively accelerate increasing function of the concentration 
of the drive substance in the blood. However, for the sake of ex- 
IMBitory simplicity we shall assume in the present preliminary 
that it is an increasing linear function. 

The affeient discharge arising from the drive stimulus (Sd) 
bmmc condition^ to reactions just the same as any other elements 
is ^imuliis ccMnfK^unds, except that they may be somewhat more 
IMiteit in acquiring habit loadings than most stimulus elements or 
drive stimulus may play a role in a con- 
ditioned stimulus ^mjK^und substantially the same as that of any 
rtamulus element or abnegate (p. 74 ff.) . As a stimulus, Sd 
natiirally manife^ both qualitative and intensity primary stimulus 
^neralaalion in comuKui with other stimulus elements or a^re- 
^tm m ^Kiiticrod ^nmilus compounds (p. 185 ff.). 

It ptrtmMe timt when blood which contains certain 

tibfown into it ^ tibe r^ult of stat^ of n^i, 
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or which lacks certain substances as the result of other states of 
need, bathes the neural structures which constitute the anatomical 
bases of habit the conductivity of these structures is aug- 

mented through lowered resistance either in the central neural tissue 
or at the effector end of the connection, or both. The latter type of 
action is equivalent, of course, to a lowering of the reaction 
threshold and would presumably facilitate reaction to neural im- 
pulses reaching the effector from any source whatever. As Beach 
11 1 suggests, it is likely that the selective action of drives on par- 
ticular effector organs in non-leamed forms of behavior acts mainly 
in this manner. It must be noted at once, however, that sensitizing 
a habit structure does not mean that this alone is sufficient to 
evoke the reaction, any more than that caffeine or benzedrine alone 
will evoke reaction. Sensitization merely gives the relevant neural 
tissue, upon the occurrence of an adequate set of receptor dis- 
charges, an augmented facility in routing these impulses to the 
reactions previously conditioned to them or connected by native 
(inherited I growth processes. This implies to a certain extent the 
undifferentiated nature of drive in general, contained in Freud’s 
concept of the ^‘libido,’’ However, it definitely does not presup- 
pose the special dominance of any one drive, such as sex, over the 
other drives. 

While all drives seem to be alike in their powers of sensitizing 
acquired receptor-effector connections, their capacity to call forth 
within the body of the organism characteristic and presumably dis- 
tinctive drive stimuli gives each a considerable measure of distinc- 
tiveness and specificity in the determination of action which, in 
case of necessity, may be sharpened by the prcM^ess of patterning 
(see p. 349 ff.) to almost any extent that the reaction situation 
requires for adequate and consistent reinforcement. In this respect, 
the action of drh^e substances differs sharply from that of a pseudo- 
drive substance such as caffeine, which appears to produce nothing 
corresponding to a drive stimulus. 

Little is knovm concerning the exact quantitative functional 
relationship of drive intensity to the conditions or circumstance 
which produce it, such as the number of hours of hunger or the 
<x>neentration of endocrine secretions in the blocKi, Judging from 
the work of Warden and his associates (19) ^ the relationship of 
the hunger drive up to two or three days of food privation vrould 
be a negatively accelerated increasing function of time, though a 
study by Skinner (Figure 52) suj^ests that it may be nearly linear 
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up to about five days. For the sake of simplicity in the present 
esplorational analysis we shall assume the latter as a first approxi- 
mation. 

Physiological conditions of need^ through their sensitizing action 
on the neural mediating structures lying between the receptors and 
the effectors appear to combine with the latter to evoke 

reactions according to a multiplicative principle, i.e., reaction- 
evocation potentiality is the product of a function of habit strength 
multiplied by a function of the strength of drive; 

sEb=-S{sBb) X/(D). 

In the next section it shall be our task to consider in some detail 
what these functions may be; if successful we shall then possess 
the main portion of a molar theory of primary motivation. 


THE QEA2SrTITATIVE DERIVATIOK OF FROM AHD D 

Since we have taken Perin^s experiment as our main guide in 
the analysis of the primary motivational problem in general, it 
will be convenient to take the need for food as the basis for the 
detailed illustration of the working of the molar theory of motiva- 
tion; this we now proceed to develop. 

Turning first to the habit component of we calculate the 
values of gifs a positive growdh function; vre use in this cal- 
culation the fractional incremental value (F) found by Perin to 
hold for the learning processes represented in Figure 50, -which 
was approximately 1 /25 for each successive reinforcement. On this 
assumption the values at various numbers of reinforcements, e.g., 
0, 1, 3^ % 18, 36, and 72, have been computed. These are shown 
in column 2 of Table 5. 

Tl^ habit-strength values of column 2, Table 5, consist of the 
physiolopeal summation of the habit-strength loadings of the stim- 
idiis cimiponejits, repr^ented by the original drive stimulus 
and the non-drive components, which we shall represent by St, 
AsOTming m a matter of convenience that Sd' and Sx have equal 
Imdinp, the value of each (see fifth terminal note) is easily cal- 
culated for the several numbers of reinforcements. TTiese valu^ 
mm ki -column 3 of Table 5. 

*I\irniiig next to the matter of drive, it will be assumed that 
©lipaal Imming twk place \mder a 24-hour food privation. 
Assinxiing ftirdicr tl»t drive is a Imear function of the number of 



TABLE 5 

Tablb Showing tub Preuminaht Steph in the Derivation of a Series of TnEORKTiCAii REAcrioN-Po'rENTiAi. Valuer from 
A Varied Set of Antecedent Reinforcements Under a Drive op 21) Units’ Strength, the Resulting Habits Being Evaluated 
FOR Reaction Potentiality at Drivk-Strenothh of 0.00, 2.50, 0.667, 13.333, and 20.0 Units. 
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hours’ hunger and that iFigure 52i the maximum of 100 motes 
would be reached at fi%’e days or 120 hours, Perm's periods of food 
privation may be converted into units of drive strength by multi- 
plying the number of hours’ food privation by the fraction 100/120. 
In this way we secure the following drive or D-values: 


Number of hours* food privation (h ) : 

0 

3 

8 

16 

Strength of drive in motes (D) : 

0 

2.5 

6.667 

13.333 

Deriation (d) of possible D's from the 
drive CD*) of origmsl learning: 

20 

17.5 

13.333 

6.667 


Now, S|? is assumed to be approximately a linear function of D. 
It follows from this and the principle of primary stimulus general- 
iiation that action evoked under any other intensity of drive (and 
drive stimulus) than that involved in the original habit formation 
must be subject to primary intensity-stimulus generalization. As- 
suming the relatively Sat gradient yielded by an F- value of 1/50, 
it is easy to calculate the value of (p. 199 ff.) at each degree 
of the five B- values taken above. These bdHs values are shown 
in columns 4, 5, 6, 7, and 8 respectively of Table 5. A glance at 
the tottom entri^ of each of the columns shows that the values 
of fall progr^ively from 46.34 at D = 20 (i.e., <i = 0.00) 
to ^.94 at i) = 0 li.e., d = 20). 

We must now combine these habit values by the process of 
physiological summation characteristic of conditioned stimulus 
eoin|K>imds fp. 2^ff.) (neglecting the effects of afferent interac- 
tion) with the habit loading of the non-drive stimulus component 
of the compound which is represented by the values appearing in 
column 3. The physiological sum m ation of the values in column 3 
with the valu^ of columns 4 to 8 gives us the habit-starength valu^ 
shown in columns 9, 10, 11, 12, and 13 of Table 5. It will be 
that Uiis final recombination of the bHe values w^here 
DznM yields exactly the same values as those of column 2. This 
is tecau^ whm reaction evocation occurs at the original drive 
IjD''), ie., where D = D% no distortion of the Sjy component of the 
habit results, the synth^is being exactly the reverse of the analysis 
which place I^tween columns 2 and 3. 

With tile thaiietieal valu^ of /(sffs) available in col umn r 9 
to IS of Table 5, we may now turn our attention to the 

of /{J». It is a^umed that 2) itself acts upon bHe as a 
dii^ However, there is the complication that other 
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or alien drives active at the time «re|>reseoted in the aggregate by 
the sj^iiibol D) ha%'e the capacity to sensitiie habits not set up in 
conjunction with them. Let it be supposed that this crenf-rallzed 
effect of alien drives adds 10 points to the actual drive throughout 
the present situation. Thus the effective drive \ operative on a 
given habit would necessarily involve the summation of D and D; 
in the case o! the 24-hour food privation a simple summaticMi 
would in the present situation amount to 10 4- 20. or 30. and at 
homrs it would be 10 4- t)r ilO. In order to maintain our 
centigrade system the simple summation must be divided by the 
maximum possible under these assumptions, or 110. Accordingly 
w'e arrive at the formula, 

D = 100 4 ^ 

Z) + 100 

w’here D represents the effective drive actually operative in pro- 
ducing the reaction potential. 

N0W5 assuming that reaction evocation potentiality is essen- 
tially a multiplicative function of habit strength and drive, i.e, 
that, 

sE^^KsHu)Xf{D), 

since /(sHs) is gHsf and fiD) is D, we have by substitution, 

X jy. 

However, since both and D are on a centigrade scale, their 
simple prcKiuct would yield values on a ten-thousand pwint scale; 
therefore, to keep gE^ also to a centigrade scale we write the equa- 
tion, 

1? sHr X D 
^ 100 


Substituting the equivalent of D and simplifying, we have as our 
final equation, 


F _ ff D + D 

- ^^6 + 100 


The second portion of this formula, with the various D valura sub- 
Btituted, is, 


10 4 - 2 . 5 . 10 + 6 . 667 . 

110 110 


10 + 13 . 333 . 
110 


10 + a) 
110 


.0009 


.1136 


.1515 


.2121 


. 2727 . 
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The values of sEjc are accordingly obtained simply by multiply- 
ing the several entri^ of column 9 by .0909, those of column 10 
bv .1136, and so on. These products are presented in detail in 
columns 14, 15, 16, 17 and 18 of Table 5, -which are the values we 
been seeking; they are shown diagrammatically by the curved 
surface of Figure 56. A comparison of the theoretical values of 
Figure 56 with the surface fitted to the empirical values represented 



Fig. GnpHie representation of the theoretical joint determination of 
rmttmji jK>t€atial by various numbers of reinforcements under a drive (D') 
of M units Mrength when functioning under drives (D) of various strengths 
than that of the original habit formation. Note the detailed agreement 
with the cxMHimrable emphiiml results shown in Figure 51. 


by Ae Girtlm m Figure 51 indicates that the theoretical derivations 
appi^xiaate the facts very closely indeed. 

Compilations analogous to the preceding have shown that the 
pr^nt set of jmstulates and constants also hold when D > D' at 
Imst up to three days of food privation. The theoretical curve 
for all valum of D between 0 and 72 hours yields a positively 
jwitaatial up to 24 houis (D' in the present 
wh^ ttiere is a sli^t inflection; as D increases above 
l>^ tb^ is at flnrt a brief period of positive acceleration, which is 
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followed by a protracted period ilm is nearly linearj tbe whole 
showing a fair approximation to Figure 52, 

Generalizing from Table 5 and Figure 56, the following eorol- 
laries may be formulated as a kind of condensed surnmniy of 
the implications of the present set of assumptions as shown by the 
preceding computations: 

L Wh£n habit strength is zerOj reaction-evocation polenfAal i$ 
zero, 

11, When primary drive strength \Ds is zero^ reaction-evocation 
potential has an appreciable but relatively low posiili e value 

which is a positive grov:ih function of the number of reinforce^ 
menis. Corollaries I and II both agree in detail with Perin s em- 
pirical findings. 

As the drive (D i increases from zero to D': 

IIL The reaction-evocation potential increases with a slight 
positwe acceleration. 

IV. The reaction-evocation potential maintains its ^sitive 
growth relationship to the number of reinforcements. Both of these 
corollaries agree in detail with Perin’s empirical findings. 

As the drive (Dl increases above D': 

V. There is a definite inflection in the bEr function at D, the 
slope for values of D just greater than D' being less than for those 
just below. 

VI. The reaction-evocation potential above D' increases at first 
with a slight positive acceleration^ which soon gives place to a prac- 
tically linear relationship. Both of these corollaries agree in detail 
with Skinner's empirical finding (Figure 54). 

MlSCmiJANEOUS COROLIAEIES FLOWING FROM THE PRESENT 
PRIMARY MOTIVATION HYPOTHESIS 

The first problem in this series is that presented by Elliott’s 
experiment described above, the outcome of which is clearly shown 
in Figure 53. Here we have the case of a reaction tendency set up 
on the basis of one drive, showing a partial but by no means com- 
plete disruption w^hen this drive (thirst) is abruptly replaced by 
another drive, that of hunger. At this point we r^all the assump- 
tion stated earlier (p. 241) that all drives alike are able to sensitize 
all habits. Applying this to the behavior of Elliott^s animals dur- 
ing the critical trial on the maze after the change in 
drive, it is to be exp^ted that while hunger was then the dominant 
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drive, certain residual amounts of various other drives (including 
thirst I were also active. These in the aggregate (£)), the hunger 
drive ineluded, are presumed to operate in a multiplicative manner 
upon the habit strength effective at the moment in determining 
reaction potentiality. It is assumed that this would be enough to 
e%-ok€% on the average, about 20 per cent as much activity as is 
evoked by the thirst drive. 

This means that the residual drive (D) must amount to con- 
siderably more than 20 per cent of the regular thirst drive, say. 
For example in the detailed analysis of the preceding section, where 
24 hours^ hunger stood at 20 units of drive, this residual drive was 
placed at 10 units, which is 50 per cent as much as 20. Neverthe- 
less, the reaction potential at 24 hours’ hunger came out at 19.42 
units, whereas that at satiation or zero drive stood at 4.58 units, 
the latter being only about 23 per cent of the former. The ex- 
planation of the paradoxical difference of 50 versus 23 per cent is 
significant; it arises largely from the fact that when the hitherto 
dominant drive ceases to be active, not only are there lost the 
20 units of drive strength previously contributed by this need, but 
there is also lost to the conditioned stimidm compcnind the sizable 
component made up by Sd, the withdrawal of which materially 
reduces the available habit strength associated with the situation in 
qmstion, and so reduces the resulting reaction potential. 

On the basis of the above analysis we may formulate the fol- 
lowing additions to the corollaries listed in the preceding section: 

VIL Under the conditions of the satiation of the dominant drive 
imvolved in the original habit-acquisition, there are sufficient re- 
siduals of other drives which in the aggregate yield on the average 
am exdiat(nry jH>tential amowiting to around BO per cent of that 
md^ilized by a 2^-lmwr hunger on a habit originally set up on the 
bam of this drive. 

VIII. In cam an orgarmm is presented with all the stimuli char- 
mteristic of a habit, if the origmal drive is replaced by a strong 
m&md drive whose Sb activates no conflicting habit tendency, the 
remctim pofenfial to the execution of the habitual act will be 
s^mget than would be the case if the irrelevant second drive were 
mot mctim. This means that if a control group with both hunger 
«id iMisI thoreu^ly satiated were to be added to the Elliott ex- 
above, the mean retardation in the running time 
imd the mean ntmte of eirors would increase appreciably above 
fwm a mere refjlac^ent of one drive by another (d). 
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A second problem concerns the relation of the experiiaental ex- 
tinction of a reaeticn tendency to the drive intensity operative at 
the time of extinction. Now, the passage from Pavlov quoted above 
Ip. 233 J strongly suggests that experimental extinction effects are 
in same sense directly opposed to reaction poteniial rather than 
merely to habit strength. Since with a constant habit strength an 
increase in the drive augments the reaction potentiah and since 
extinction effects are an increasing function of the number of unre- 
inforced evocationSj it follows that: 

IX. The number of reinforcements being constant, the stronger 
the relevant drive, the greater will be the number of unreinforced 
evocations which will be required to reduce the reaction potential 
to a given level. 

X. The number of reinforcements being constant, the stronger 
an allied bid irrelevant drive active at the time of extinction, the 
greater will be the number of unreinforced evocations required to 
reduce the reaction potential to a given level, though this number 
will be materially less than would be required under the same intent 
sity of the relevant drive. 

Tims if a habit set up on the basis of a thirst drive tvere extin- 
guished under a sizable hunger drive but with water satiation, the 
theory demands that the reaction potential would extinguish with 
few-er unreinforced evocations than w’ould be the case under the 
same intensity of the thirst drive in conjunction with a zero hunger 
drive; moreover, such a habit would require more unreinforced 
reaction evocations to produce a given degree of extinction under 
a strong hunger drive than under a weak one. By the same tyf^ 
of reasoning it is to be expected that if a reaction tendency were 
set up in male rats under hunger or thirst-, and if subsequently a 
random sample of the organisms w^ere castrated, experimental ex- 
tinction under a normal hunger dri^^e w’ould occur more quickly 
than it would in the non-castrated organisms. 

At this point w'e turn to a more detailed consideration of Pav- 
lov’s observation just referred to, that wrhen he had perform^ an 
experimental extinction under a given drive and then increa^ 
the drive, the conditioned stimulus would a^n evoke the reaction. 
This may be deduced rather simply: If a certain number of unreio- 
forced evcKiations of a reaction have prcxiuc«l sufficient extinction 
effects to neutralize a given amount of ^citatory potential, an 
increase in the drive will incre^ the excitatory potential which 
the existait extinction effects will no longer suflfce to neutralize 
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completely. The balance of the reaction potential will accordingly 
be availaMe to evoke reaction and, upon adequate stimulation, will 
do so. We thus come to our eleventh corollary: 

XL If a reaciim tendency is extinguished by massed reaction 
evocations under a given strength of drive, and if at once there- 
after the drive is appreciably increased, the original stimulation 
uiil again evoke the reactian. 

Our final question concerns an exceedingly important problem 
in adaptive dynamics. It has already been pointed out that as a 
rule action sequences required to satisfy a food need are different 
from those require to satisfy a water need, and both would ordi- 
narily q\iite different from the acts which would be required to 
satisfy a sex drive. This problem is posed very sharply when, as 
in the Hull-Leeper experiments, an organism is presented with an 
identical objective situation and required to make a differential 
ruction purely on the basis of the need dominant at the moment. 
These experimeits confirm everyday observations that animals can 
adapt succ^fully to ^ch situations. The question before us is 
how this behavior is to be explained. 

At first si^t it might be supposed that in this situation the 
animals would merely a^ociate & with turning to the right, say, 
and St with turning to the left, and that adaptation would thereby 
be complete. A little further reflection will show, however, that 
this simple explanation is hardly adequate, because if there were 
really an independent and functionally potent receptor-effector con- 
nection between the hunger-drive stimulus and turning to the right 
the animal w'ould, when hungry, be impelled to turn to the right 
continuously when in its ca^ or wherever it happened to be, as well 
m at the choice point in the maze. The animals, of course, display 
no meh behavior, any more than we ourselves do. 

The pr^mt set of p<^tulates mediates the explanation chiefly 
ce the b^is of a secondary process known as patterning. Unfor- 
tunately it will not jmsible to ^ve an exposition of this exceed- 
ingly import^t subject until a later chapter (see p. 349 ff.). How- 
finding the detailed presentation in that place we shall here 
merely indicate dogmatically the nature of jmtteming and briefly 
^etch application of this ^ondary principle to the problem in 
adai^ive dynamics now before us. 

By term ^^tteming” we mean the process whereby organ- 
e^)acitj of reacting (or not reacting) to partic- 
niar of stimuli as distinguished from the several com- 
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ponent stimulus elements or aggregates making up the compound. 
At bottom tins process turns out to be a ease of learning to dis- 
criminate afferent interaetion effects qu 42 11.1. Spcfififeilhu the 
principle of afferent interaction implies that in the IIuibLee|>er 
studies afferent impulses arising from the environmeiital stimuli 
are somewhat diSorent when stimulation occurs in fomhina- 
tion with the hunger-drive stimulus \Shi from those which result 
from the same stimulation in combination with tiie thirst-drive 
stimulus t'SsC Similarly, the afferent impulses arising from Si i%iid 
St are somewhat different when initiated in conjunction with Si 
from those initiated by S* and St in the cage or other situations. If 
the afferent impulses arising from the environmental stiriiiili un- 
complicated by any panicular drive be represented by ,s, then these 
impulses when modified by the interaction with the hunger-drive 
stimulus may 1:^ represented by Sh, and when modified by inU^ac- 
tion with the thirst-drive stimulus, by st. Since there are but two 
alternatives, it is to be expected that at the outset of training, reac- 
tion wT3uld be about 50 per cent correct. However, as the differen- 
tial reinforcement yielded by the techniques employed in these 
investigations continues, the gradient of generalization between if 
and $i would progressively steepen, as shown in Figure 60 ; i.e., 
discrimination learning w’ould gradually take place, exactly as it 
does in fact. Ilius w'e arrive at our twelfth corollaiy": 

XII. Organisms v:ill leam to react differentially to a given 06- 
jective situntion according to the drive active at the time, and to 
react differentially to a given drive according to the objective dtwa- 
turn at the time. 


SUMMARY 

Hie needs of organisms oj^rate both in the formation of habits 
and in their subsequent fxmctioning, i.e., in primary motivation. 
Because of the sensitizing or energizing action of needs in this latter 
role, they are called drives. 

A great mass of significant empirical evidence concerning pri- 
maiy motivation has become ai^ailable within recent yeare. A sur- 
%"ey of this material, particularly as related to hun^r, thirst, injury 
(including the action of very intense stimuli of all kinds), sex, and 
the action of certain sut^tances such m caffeine, has led to the 
tentative -conclusion that all primary driv^ pitxluce their effects 
by tibe action of various chemicals in the blocKi. Substances like 
caffeine, through bathing the neural mechanisms involved, seem to 
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operate by heightening the reaction potential mediated by all posi- 
tive habit tendencies. Drive substances, such as the various endo- 
crine secretions, are conceived either to be released into the blood 
by certain kinds of strong stimulation or as themselves initiating 
stimulation of resident receptors through their evocation of action 
by selected portions of the body, e.g., the intestinal tract and the 
genitalia. In both cases the energy effecting this receptor activa- 
tion is called the drive stimulus (Sjd). 

The action of these endocrine substances, while apparently low- 
ering the reaction threshold of certain restricted effectors (i, p. 
184 ff.), seems al^ to have a generalized but possibly weaker ten- 
dency to facilitate action of all effectors, giving rise to a degree of 
undifferentiated motivation analogous to the Freudian libido. Thus 
a sex hormone would tend to motivate action based on any habit, 
however remote the action from that involved in actual copulation. 
This, together with the assumption that one or more other motiva- 
tions are active to some degree, explains the continued but limited 
amount of habitual action of organisms when the motivation on the 
basis of which the habit was originally set up has presumably be- 
come zero. It also suggests a possible mechanism underlying the 
Freudian concept of sublimation. However, where differential be- 
havior is required to bring about reduction in two or more drives, 
the differences in the drive stimuli characteristic of the motivations 
in question, through the principle of afferent interaction and the 
resulting stimulus patterning, suffice to mediate the necessary dis- 
crimination. 

The hypothesis of the endocrine or chemical motivational mech- 
anism and the associated principle of the drive stimulus, when 
coupled with various other postulates of the present system such 
m primary reinforcement, primary stimulus generalization, and 
tiie of^x3sition of experimental extinction to excitatory potential, 
mem to able to mediate the deduction, and so the explanation, 
of nearly all the major known phenomena of primary motivation.^ 
In addition to the phenomena already summarized there may be 
mentioned the further deductions flowing from the system: that 
r^istanee to extinction maintains a consistent growth function of 
numl^r of reinfoicements for any constant drive; that the 

^ of to involve tbe action of fractional ante- 

imctkins of orientation. Space is not here available 

fm tte cf ntedbaniams and their action in motivational 



PRIMARY MOTIVATION AND RJEACTION TOTENTIAL 253 

mymptotes of these growth functions are themselves functions of 
the strength of drive; that for constant habit strengths^ reaction 
potential has a positive acceleration for increasing drives between 
lero and the drive employed in the original reinforcement; tfiat if 
habit strength is zerOj reaction tendency is zero; that an increase 
in drive will over-ride the total extinction of a reaction potential 
arising from a weaker drive: that in a given objective habit situ- 
ation the abrupt shift from one drive to another will, in the absence 
of discriminatory’ training, disrupt the behatior to some extent^ 
though not completely: that transfers of training Ihabitsi from one 
motivation to another will be prompt and extensive ; that organisms 
in the same externa! situations will learn to react differentially in 
such a tvay as to reduce different needs; that the conditioned evoca- 
tion of endocrine secretions facilitates the evocation of muscular 
activity on the subsequent presentation of appropriate conditioned 
stimuli, which is believed to be the role of “emotion” in the moti- 
vation of behavior. 

On the basis of the various background considerations elabo- 
rated in the preceding pages, we formulate our sixth and seventh 
primary molar laws of behavior; 

POSTULATE 6 

Associated with every drive (D) is a characteristic drive stimultis (Sn) 
whose intensity is an increaring monotonic function of the drive in 
question. 

POSTULATE 7 

Any effective habit strength (sHr) is sensitized into r^ctkii po- 
tentiality (sEr) by aH primary drives active within an organism at a given 
time, the magnitude of this potentiality being a product obtained by 
multiplying an increasing function of sHr by an increasing function of i>* 

From Postulate 5, 6, and 7 there may be derived the following 

corollary: 

MAJOR COROLLARY H 

Ihe amount of reaction potentiaHty (sEr) in any given p “ ^ 

tional situation is the product of (1) ffie effective haMt strength 4. Sj^r) 

under the existing conditions of pdmaiy drive multi|ii^ by (t) the 
quotient obtained from dividing ffie sum of the dominant value of the 
primary drive (D) jdus the ag gr egate strengffi of all the non-dominant 
primary drives (Z>) active at the time, by the ®mi of the same non- 
chuninant drives {dus the |^i3r^h^cal drive maximum (Mb). 
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NOTE^ 

Matlieniatical Statement of Postulate 6 

= bf(D), 

where 

6 > 0 . 

^lathematical Statement of Postulate 7 

sEb =f(sHB) XfiD) (33) 

Mathematical Statement of Major Corollary II 

sBr = & 4 . st^Sr (34) 

wtee 

D = &e strength of the dominant primary drive at a moment under 
^Bsi^^ration 

D = the aggr^Lte strength of all the non-dominant primary drives and 
quasi-drives at the moment under consideration 

Mb = the phymologic^ drive maximum (100 motes) 

fii = th^ iK>n-drive component of the ^mulxis complex at the moment 
mnfer cxiimderation 

Sb =* tl]^ sdmulus ^)ecifi<mlly dependent upon the primary drive at the 
nKjn^nt under eonaderation 

M^sj^s = tte phymdc^e^ aimmaiion of b^Sr and Si^Sr 

s^Hm = effective iudiit hiding of the non-drive component of the stimulus 

complex 

M^E = the effective habit loading of sj^Hr 

The Equations of Perm's Graphs 

TIm? curve drawn ^ upper set of data points of Figure 50 was plotted 

f rcsi the fitted equati<m : 

n = 65(1 - ^ - 4 , 

wi«e m ti^ number of uiireinforced reaction evocations to produce 

extincrion and N repr^nte the number of reinforcements in the 
spring up erf the habit. TT^ curve drawn throng the lower set of data points 
ctf Furore 50 wm from tl^ fitted equation: 

s = ^(1 - ^ - 4 , 

wte:e » aiai N have tiie rignific^nce as in tire preceding equation. Note 
^ ktetity of tim .0180 and .0185. 

TM &»wa ti» date pmnte oi Figure 49 was plotted from the 

a -4, 
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where n meaiis the same as ^x)ve and h represents the number of hours of food 

privatioa. 

The surfaee jmssiag amoag the data points of F'iguie 51 was by 

the fitted equation : 

n = 21.45 CIO — i)Cl - ‘^1 - 4 (35) 

in which and .V the same as before. A eompaii^n of thi.^ equation 
with the preceding equations show's that it is essentmJIy the pmitive growth 
function of the first two equations in which the asymptote lias l>eeii taken by 
a function of the drive (k) derived from the third equation. Tl:ius n, regarded 
as action potentiality, may be to be a multiplicative function of h, or nioti- 
varioa, and A’, or habit strength. 

The Equations Employed in the Derivation of Table 5 and Figure 56 

Hie pc^tive growth function from which the values in edumn 2 of Table 5 
were derived is: 

- 75(1 - 

The equation by which the values of column 3 were derived from thiBe In 
edumn2is: 

sETe = 100 - v' 10,000 - 100:.-, - 

This equation is a special form of that reprinting the ph 3 rBiolc^cal summaticm 
of two hsd>it tendenci^ given below. 

The equation from which the values of the drive (D) m*ere calculated from 
the number of hours’ food privation (h) is: 



The values of tl:^ drive deviations (dO were calculated from the equation: 

d' = ly -D, 

where IF reprints the strength of drive employed in the formation of the habit 
and D reprints the strength of drive uncier which stimulatian calculated to 
Imd to ruction evocation occurs. 

The equation by means of which the values of columns 4, 5, 6, 7, S of 
Tabk 5 were mloilat^ from tho^ of cdumn 3 is: 

- s^BMao-’^^dr 

TM% it may be noted, is equation 29 (p. 199 ff.), the equation of primary stimuli^ 
generalization in which reprints the efiective-habit-^trength loading of 
the drive stimulus. 

The equation by means of which the valu^ of columns 9, 10, 11, 12, and 13 of 
Table 5 were cdculated from those of c^umns 4, 5, 6, 7, and S k: 

H _ 5 ■ W ^ ^ . 

- S^tim i" 


Tl^ vbIu^ of ^^ve drive (D) were foumi by the equaricm: 


D 


= 100 


i>+ D 
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in whicli i) is mpp<md to be the sum of the generalized effects of all the irrelevant 
driv^ active at the time, and Mn represents the nmximum drive possible in a 
4^ntigrade system, i.e., 100. The value of t) found by trial to fit the Perin data 
fairly well is 10. Therefore the equation becomes: 


D = 100 


10 -fD 
110 


For example, in case D m maximal this equation becomes : 


D 


= 100 


110 

110 


= 100 . 

TIk Imsic equation by means of which the values of columns 14, 15, 16, 17, 
ami 18 of Tshle 5 were calculated from the values of columns 9, 10, 11, 12, and 13 
€rfTabte5,k: _ 

bEm = «!+ s^Ss X 

&b6tituting ti® equivalent of D and simplifying, this becomes: 

rr ^ V/ 10 4“ 

X — 
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CHAPTER XV 


Unadaptive Habits and Experimental Extinction 

Science tacitly assumes that similar causes will be followed by 
similar effects. In the field of behavior djmamics it is accordingly 
to be expected that an act which has been followed by a need 
reduction in a given situation will always be so reinforced when- 
ever the reaction occurs in other exactly similar situations. It may 
be noted in this connection first that, as a rule, most of the factors 
of such situations possess energies which activate one or more of 
the receptors of the reacting organism. Secondly, according to the 
law of reinforcement (p. 80 ff.) , all stimuli whose receptor dis- 
charges are contiguous with reactions -which are followed by rein- 
forcing states of affairs tend to acquire the capacity of later evok- 
ing that reaction. As a result of this combination of circumstances 
it comes about that on the recurrence of the situation in question 
(including the need) the corresponding stimuli must also recur, they 
will evoke the reaction conditioned to them, the need wiU be re- 
duced, and survival will be facilitated. 

At this point of the analysis, however, serious complications 
appear. In the first place, exact duplicates of situations probably 
never recur. A second complication is that by no means all of the 
factors of any reaction situation are critical in the sense that their 
prince is necessary for the act in question to bring about need 
reduction. A third and closely related complication lies in the fact 
that the really critical factor or factors of the reaction situation 
may not stimulate the receptors of the reacting organism at all; 
in the field of vision, for example, the view of the critical factor 
may cut off by the interposition of a completely irrelevant ob- 
jact. Since oiganisms have no inner monitor or entelechy to tell 
them Ml advance which stimulus elements or aggregates are asso- 
ciated with the critical causal factor or factors of reaction situa- 
tions, the law of reinforcement, other things equals will mediate 
lAc conrmctums of ike non-cntical stimidus elements to the reaction 
as m thme of the critical ones. 

Mm a r^ult of tiie Largely random flux of events in the world 
to whfch react, it inevitably comes alK)ut that they 

wifi of^ he Simulated by extensive groups of conditioned stim- 

m 
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ulus elements, none of which is causally related to the critical factor 
or factors in the reinforcement situation. In such cases, if the 
srimuli evoke the reaction it will nut be follow'ed by reinforcement. 
This, of course, is wasteful of energy' and therefore unadaptive. 
The necessarily uiiadaptive nature of an appreciable portion of 
the habits set up by virtue of the law of reinforcement naturally 
raises the question of how organisms are able to survive iinder such 
conditions (7^ p. ^Ij. The answer is found in the behavioral 
principle known as experimental extinction. This is the subject 
of the present chapter. 

CONCRETE EXAMPLES OF EXPERIMENTAL EXTINCTION 

The principle of experimental extinction is so ubiquitous a 
factor in behavior dynamics that it can hardly escape anyone's 
obser^'ation. The dog w'hich has been taught to '‘speak” for food 
soon ceases to do this if the food, petting, etc., is systematically 
withheld following the act. The principle has even passed into 
folklore, as showm by the fable of the boy who shouted “Wolf! 
Wolf!” when no wolf was near. After a few such false alarms the 
rescuing behavior of the hearers would inevitably become extin- 
guished, and they would cease to respond to the calls quite as the 
fable states. 

The systematic investigation of experimental extinction origi- 
nated in Pavlovas conditioned-reflex laboratory in Petrograd. Since 
the comparative simplicity of the conditioned-reflex technique 
brings out with maximum clarity the%^ntial principles involve, 
a description of one of Pavlov's experiments will as a useful 
introduction to the technical aspects of the subject, even though 
the artificiality of the experiment may tend somewhat to obscure 
the functional significance of the principle. Pavlov reports having 
produced a conditioned reflex by first showing a dog some meat 
powder and then letting him eat it. After a considerable number 
of reinforcements extending over several days, the mere visual stim- 
ulation of the fcKKi would evoke a profuse flow of saliva. Tbe meat 
fKiwder was then presented at a distance for a number of ^-second 
p^riewis, but without being follow^ by tihe customary feeding. On 
each of the latter (Mjcasions the number of cubic centimeters of 
saliva ^ret^ r^onled. The results of this piwedure are 
shown in Table 6, where it may be ^n (1) that after only a few 
ncm-remforced reactioi^ the visual stimulus completely lo^s ite 
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TABLE 6 

Table Summarizing the Results of an Experiment Involving Experi- 
mental Extinction, Performed in Pavlovas Laboratory (5, p. 58). 


Succ^ive 
Unreinforced | 

Stimulations 

Number of Cubic Centimeters 
of Saliva Secreted in 

Each 30-Second Period 

1 

1.0 

2 

.6 

3 

.3 

4 

.1 

5 

.0 

6 

.0 


power of reaction evocation, and (2) that the course of this loss 
is progres^sive, the rate of fail being more rapid at first than later. 


EXPERIMEXTAL EXTIJ^’CTIOJ^ AS A FUKCTION OP THE ISTUMBER 
OF UIOIEIK-FOECED REACTIONS 

The process of experimental extinction has now been studied 
in many other laboratories where many different reactions have 
been extinguished under many different conditions. One of the 
more easily interpreted of these studies has been reported by Hov- 
land This investigator conditioned a simple sinusoidal sound 
wave of 10(K) cycles per second to the galvanic skin reaction in 
^ human subjects; the 24 reinforcements (a weak electric shock 
to the wrist) by which the habit was originally set up were sepa- 
rated by ^Lminute rest pauses after the first and second series of 
The habit was then extinguished by repeated evocations of 
the act without reinforcement. The pooled results of the first five 
reactions of this extinction process presumably yield a set of valu^ 
elo^ly approximating the typical rate of extinction. They are 
lep'^nted graphically by the circles of Figure 57. In order to 
determine more preci^Iy the characteristics of this negative leam- 
mg eime, a simple negative growth function was fitted to the 
values represented by the circles. From this the smooth curve 
p^^ing among the circles w-as plotted. A glance at this curve shows 
tt&l, at the lero jxiint^ the fit is excellent. Hie failure of 

amplitude of the reaction at point 0 to be as high, relatively, 
t^ oih^ amplitude may plausibly be interpreted as due to 
iaMbitiim of reinfcrconent^^ (^e p. which is subsequently 
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^^disinhibited" in parij at least, by the abrupt change in experi- 
mental routine incidental to the process of extinction |5 k We 
conclude, then, that the cur\’e of experimental extinction when un- 
complicated by irrelevant factors is probably a simple negative 
growth function. 

Further examination of the smooth cur\’e in Figure 57 reveals 
two additional characteristics w’hich merit consideration. The first 



NUMBER OF PRECEDING EXTINCTION TRIALS 


Fig, 57. A graphic representation of the course of habit deerenicnt as a 
function of succe^ve unreinforced evocations of the previously learned act, 
(The data were received from Hovland in a private eommunication.) 

is that the asymptote (limit of fall) of the curve is not zero, but 
about 24 per cent of the amplitude of what the reaction was before 
extinction began. This is probably an artifact due to the well- 
known tendency of the skin to yield appreciable gah^anic reactions 
even to mild stimulations previous to any specific conditioning 
whatever (see p. 186 ff.). In this resj^ct it is believed that the 
Favlovian results shown in Table 6 more truly represent the quan- 
titative principle of experimental extinction. The second notable 
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characteristic of the function shown in Figure 57 is its strikingly 
rapid rate of decrement, its F- value being a little less than By 
way of contrast we may consider the rate of acquisition of a com- 
parable habit, which is shown above in Figure 21 (p. 103). The 
fractional increment iF) of this latter growth function is approxi- 
mately 1/14, which corresponds to a radically slow^er rate of change. 
Thus we arrive at the indication that the rate of decrement under 
the present set of conditions is more rapid than is the rate of the 
original acquisition of the habit. 

In this connection it must be pointed out that of those reactions 
extensively investigated, the galvanic skin reaction and the salivary 
reaction both show a progressive decrement in amplitude as experi- 
mental extinction progressively reduces the excitatory tendency to 
lero. The typical motor reaction differs sharply from this by dis- 
playing under comparable conditions mainly an increase in latency 
and a decrease in probability of occurrence. Motor reactions under 
certain conditions, at least, are apt to show an increase in reaction 
intensity in the early stages of extinction, though in the later stages 
there is usually a slight tendency to a diminution in the intensity 
of the reaction {£, p. 148). Because of these differences it is prob- 
able that the salivary reaction is the one best adapted for the 
quantitative determination of the characteristics of the curves of 
both simple learning and simple extinction. 

THE STIMULUS GE:OTRALIZATIO]Sr OF EXTINanON EFFECTS 

In an earlier chapter (p. 183 ff.) we saw that habits manifest 
the phenomenon of stimulus generalization. Since physiologically 
maladaptive habits (when first formed) are no different than other 
^eitatory tendencies, it is to be expected that they likewise will 
generalize. We have also seen that experimental extinction pur- 
paiiB to a Mud of corrective mechanism. It is evident, however, 
that if ex|^rimental extinction is to correct false habit tendencies 
with reasonable efficiency, extinction effects must also generalize; 
if extinction m-ere a mere phenomenon, false receptor-effector 
emmmumm would probably never become wholly eradicated be- 
the of extinguishing a zone consisting of infinitesimal 

literally mterminafale. 

As a matter of fact, extinction effte:s do manifest stimulus 
to a marked d^ree. TMs important phenomenon, 
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like so many others in this fields also appears irst to have been 
investigated in Pat'lov’s laborator\\ For exampie, the sound of 
a buzzer, the sound of a metronome, and a tactile stimulation were 
separately conditioned in a dog to the saiivaiy^ reaction induced 
by having w’eak acid injected into bis mouth. At the conclusion 
of the positive training the buzzer evoked 13 drops per SO-secood 
period, the metronome, 12 drops, and the tactile stimulus, 4 drops. 
The metronome was then extinguished by unreinforced presenta- 
tions at 3-mmute intervals until it evoked no secretion whatever. 
A few minute later the secretion evoked by the tactile stimulus 
had fallen to zero and that by the buzzer had fallen to 2.5 drops 
\ 9 , p. 55i. The effects of the extinction of the metronome habit 
had clearly generalized in such a w'ay as to inhibit completely the 
tactile habit and almost to inhibit the buzzer habit. 

Pavlov even investigated the quantitative gradient of general- 
ized extinction effects. He set up a number of ^‘homogeneous*’' 
conditioned reflexes to a series of points on the skin, then extin- 
guished the reaction whose stimulus was located at one extreme of 
the series and noted the extent to which the other conditioned 
reflexes were weakened as a result. On the basis of such experi- 
ments, Pavlov concludes (P, p. 158) : 

It is plain that, the further away on the skin the ^condarily inhibited 
place is from the place which undeigoes the primary inhibition [extinc- 
tion], the weaker is the irradiated inhibitor>' after-eff^t. 

Sul^equently, Anrep (f), Bass and Hull (5), and Hovland ( 8 ) 
sought to plot the generalization gradi«xt of extinction effects with 
progressively more refined experimental pixMjedmm Anrep em- 
ployed cutaneous stimulation with the salivary reaction to food 
in do^; Bass and Hull employed cutaneous stimulation with the 
galvanic skin reaction evoked by an electric shock in humans. 

Hovland conditioned the pitch of four pure ton^ an equal 
amount to the galvanic skin reaction evok^ originally by a vreak 
electric shock. The ton^ were so chosen that they differs! fiom 
each other by 25 di^rimination thresholds (jja.(L*s). Ihen a tone 
at one or the other edreme of the series w^ extinguished to a 
partial but known degr^, aft©* which all four ton^ ware tc^ed to 
deteimine the strength of the i^duaJ excitatory tendency evwable 
by each. The pooled results of the twenty subjects employed in 
this inv^tigation are repi^entai graphically by the circle in 
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Figure 58. This series of circles constitutes a clear verification of 
the gradient of generalized extinction effects reported by Pavlov, 
It is evident that experimental extinction is an extended and not 
a point phenomenon. 

The precision and general reliability of Hovland’s data also 
warranted an attempt at a determination of the mathematical 



Fiq. 5S. Empiri(^l stimulus generalization gradient of an extinguished 
g^lTanlc i^n reaction plotted from data published by Hovland (e). The 
gradient ex^nds in both directions on the stimulus continuum (rate of 
v&atioB) from the point extingui^ed (0); d is the difference between the 
dHigmalj conditioned and the stimulus evoking the reaction. Note 
that here the ipadbents axe directly the reverse of those shown in the cicely 
rek^ F^ure 42, p. 185. 


charactaistics of the gradient. To this end, an equation was fitted 
to the values represented by the circles. From this equation was 
pMm the smmih. curve running among them. That the equation, 
a smple iK^itive ^owth function, fits the data rather well may 
aesi an of file figure. We accordingly conclude 

that ^ gr^mt of stimulus generalization extinction effects is 
prolmWy m dmpie p^five growth function. 
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THE IXTERACnOX OF THE GEADIEXIS OP EXCTTATIOX AXB OF 

EXTIXCriOX 

It 25 clear from a comparison of Hovland s excitation and ex- 
tinction gradients (Figures 58 and 42, p. 264 and that if thcdr 
parameters turn out to be alike the second is exactly the shape 
which would be required completely to eliminate the first. The 
conditions of the two experiments are such that an exact agree- 
ment is not to be expected between the two maximum opposing 
values. In the matter of the constant incremental factor of change 


m 



Fig. 59. Diagram repr^entiag the manner in which the gradients of ex- 
perimental extinction are supposed, theoretically, to interact with the gradients 
of a false or nnadaptive reaction tendency in such a way as to elimiimte the 
latter. The excitatory gradients are represented by the upper curres, thc^ 
of extinction effects are reprinted by the lower curves, and the reside cff 
effective excitatory tendency is repre^nted by the dotted line in between. 

(F), which might easily be the same, we find that agreement also 
does not exist; the F-factor of the excitation generalization is ap- 
proximately 1/33, whereas that of extinction is approximately 1/21, 
the range of the former being appreciably the greater. A difference 
of this order might, however, e^ily arise from smnpling ^^eirom.” 
A considerable amount of careful quantitative res^ioh is badly 
needed to clear up this matter. 

The manner in which the two gradimts are conceived to inter- 
act in the elimination of the unadaptive eff^ts of false (and there- 
fore nnadaptive) reaction tendenci^ is shown in Figure 59. In 
this figure the upper jmir of gradients represent excitation. They 
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are drawn on the assumption that reinforcement occurred at point 
zero on the stimulus continuum and diminished at the rate of 
approximately 1/29 at each additional j.n.d. of de\dation of the 
evoking stimulus. This value was chosen because it falls about 
midway between the F- values of Hovland's t’wo gradients. Extinc- 
tion is assumed also to have taken place at zero on the stimulus 
continuum, and to have continued imtil no reaction would be evoked 
by that stimulus, i.e., until the reaction tendency had passed be- 
neath the reaction threshold. Since the reaction threshold is taken 
at 10 units, this means that the extinction effects must have become 
great enough to neutralize 90 — 10, or 80 points of excitation; i.e., 
the extinction effects must possess a negative strength of 80 units. 
The generalization gradients of these extinction effects are plotted 
on the basis of the same F-constant as are those of the excitation 
effects, \dz., 1/29. By subtracting the extinction gradient from 
the corresponding excitation gradient, we obtain the effective or 
residual strength of the excitatory tendency, which is represented 
by the dotted line. This shows that at all points on the stimulus 
continuum the effective excitatory tendency is below the reaction 
threshold, i.e., at no point on the stimulus continuum will the 
unadaptive reaction be evoked. 

A rather different picture, and one of great theoretical signifi- 
cance, emerge when extinction takes place, not at the point of 
reinforcement but out on one of the wings of the excitation gradi- 
ent {8, p. 25). In nature this typically occurs when generalization 
has extended on the stimulus continuum to a point where the reac- 
tion will no longer be reinforced. Such a situation calls for dis- 
crimimtim on the part of the organism; thus primitive stimulus 
generalization is in a sense indiscriminate and is the natural anti- 
tJb^is of discrimination. For example, in Figure 59 the excitatory 
hsidency before extinction would evoke reaction with varying de~ 
of intensity or probability anywhere on the stimulus con- 
tinuum within atout 64 j.n.d.’s of the point originally reinforced. 

Suppose, however, that the stimulus at 8 j.n.d.^s on one side 
of the point of reinforcement represents a state of affairs which 
in conjunction with the habituated act will not yield reinforcement. 
SupfKBe further that this stimulus is presented to the subject re- 
imiE it will no longer evoke the reaction to any degree 
Tte interaction of the generalization gradients of the 
effete so produced, with thcBe of the original excitation 
m Awn in Flpme There, as in Figure 59, it may 
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be seen by the course of the dotted line that the point extinguished 
diminishes the effective exeitatoiy tendency to the reaction thresh- 
old, a reduction of some 58 points. It will also be noticed tlsat at 
no place beyond this point does the effective excitatory tendency 
rise above the reaction tlireshold. Hc-.vt-.vr, between the point 
extinguished and the point of original reinforcement the curve of 
effective excitation rises steeply to a value of 45, which is far above 
the reaction threshold. This means that through the interaction 



Fig. 60, Diagram representing the manner in whieh the gradients of ex- 
perimental extinction are suppcsed, theoretically, to interact with the gradients 
of a pc^tive ruction tendency to generate the phenomenon of di^rrimination 
learning. As in F^re 59, the upper curve reprints excitation, the lower 
curves represent extinction effects, and the dotted line in between reprints 
the redduai ^active reaction tendency. Note the greatly steepened gradient 
on the latter curve between 0 and S jjaul.^s. It is largely becaw^ of this that 
the improvement in simple discrimination is believed to <K«Mir. 


of the gradients of excitation and extinction the phenomenon of 
simple discrimination learning has been generated as a secondary 
prindpleJ- 


THE BEACTION GENEBAldZATION OF EXUNCHON EFfECIB 

Just as the adaptive aspects of behavior dynamics require a 
stimulus generalisation of extinction effects to neutmliie eff^tivelj 

^Tliis Tww of timple disOTnination leOTting follows substantially the ap- 
pimch developed by Spence in numemi^ theoretical and experimental Mndi^ 

ao, il, if, if, 14 ). 
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the unadaptive habits inevitably set up in considerable numbers 
by the indiscriminate action of the law of reinforcement, so the 
reaction generalization tendencies of unadaptive habits require for 
the survival of organisms that there be a complementary reaction 
generalization tendency of extinction effects. Here again Pavlov 
made the initial discoveiy^ He reports {9, p. 54) : 

This latter phenomenon [generalized extinction effects] involves not 
only those conditioned reflexes which were based upon a common uncon- 
ditioned reflex with the primarily extinguished one (homogeneous con- 
ditwned reflexes), but also those which were based upon a different un- 
conditioned reflex (heterogeneous conditioned reflexes). 

In Pavlovas terminology, if a tone and a tactile stimulus were each 
separately conditioned to an alimentary salivary secretion by the 
use of food reinforcement, the resulting reflexes would be homo- 

geneous. If, on the other 
hand, the tone were rein- 
forced with food and the 
tactile stimulus were rein- 
forced by weak acid being 
injected into the mouth, the 
tactile conditioned reaction 
would be defensive and ob- 
servably different in nature,* 
the latter two conditioned 
reactions would accordingly 
be called heterogeneous. 

An analogous phenome- 
non in a selective learning 
situation was reported by 
Youtz (15 ) ; a closely related 
study worked out with me- 
ticulous care has been per- 
formed by Ellson (4)- Li 
the latter investigation al- 
bino rats were placed, one at 
a time, in a small, sound- 
rfildded, cubical space. From one wall of the chamber there pro- 
pctei ^0 b^, as diown in Figure 61. These bars could be in- 
into tiie chamber or retracted at will, without opening 
Ehirmg the learning and extinction proce^^ 



Fla. 61. Sketch dsowmg the two 

employed m Elison’s experi- 
Uf p. 341). 
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only one bar was presented at a time. A lump of moist loixi was 
placed behind the panel so that its odor would be carried through 
the bar slot by the renlilating system. By investigating this odor 
the animal sooner or later w'ould move the bar in question^ the 
horizontal bar downward, and the vertical bar to the left. Either 
movement caused a magnetic release mechanism behind the panel 
to give a click and at the same time a cylinder of food dropped 
into a cup beneath. In this w'ay the horizontal-bar habit was set 
up ; the vertical-bar habit w’as set up 30 minutes later in an exactly 
analogous manner. 

On the next day the tw'o habits were extinguished in succession, 
first the vertical-bar habit, and 5.5 minutes later the horizontal- 
bar habit; this vras done by severing the electrical connection be- 
tween the manipulative bars and the food-release mechanism. The 
horizontal-bar habit was extinguished in control animals without 
a preceding extinction on the vertical-bar habit. It was found that 
on the average the control animals operate the horizontal bar 
49.7 times before a given degree of extinction supervened, whereas 
5.5 minutes after the extinction of the vertical-bar habit the experi- 
mental animals extinguished on the horizontal bar to the same 
degree after only 23.9 operations of the bar. This reduction in 
resistance to experimental extinction of approximately 50 per cent 
clearly su^ests reaction generalization of extinction effects. 

It is to be noted, incidentally, that neither in the Pavlovian 
experiments nor in that of Ellson is the reaction generalization 
uncomplicated by stimulus generalization. In both cases there was 
a decided element of stimulus similarity between the original ex- 
tinction and the subsequent generaliiation-of-extinetion situation 
in that the incidental stimuli from the apparatus environment were 
identical. Neverthele^, these experiments demonstrate that extinc- 
tion effects are transferable from one reaction to another reaction 
which is to a considerable extent different. 

THE SPONTANEOUS RECX)VERY OP EXTENCTION EFFECTS 

The account of experimental extinction contained in the preced- 
ing sections of the present chapter, together with the somewhat 
misleading use of the word eoctincticn in this connection, might 
easily su^^t to the uninitiated that experimental extinction neces- 
sarily abolishes completely and permanently the unadaptive reac- 
tion tendency involved. This is far from being the case, as Pavlov 
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himself long ago pointed out (4, p. 60). That a reaction tendency 
may be very much alive after total experimental extinction is 
demonstrated in a striking manner by the fact that if the condi- 
tioned stimulus is withheld from the organism for some time after 
experimental extinction has occurred, its reapplication will evoke 
the reaction to a considerably greater extent than it did at the 
conclusion of the original extinction. This is known as spontaneous 
tecove-ry. 

The phenomenon of spontaneous recovery was discovered by 
Pavlov; there is accordingly a certain appropriateness in choosing 
our initial illustration of it from his writings. This example comes 
as a sequel to an extinction experiment reported above (p. 259 ff.) 
and summarized in Table 6. Following the extinction there de- 
scribed, the dog was left to itself for two hours, after which the 
conditioned stimulus (visual presentation of meat powder) was 
again delivered. This was followed by .15 cubic centimeters of 
salivary secretion, which showed that the reaction tendency had 
spontaneously recovered about one-sixth of its original strength. 
Usually the rate of recovery is greater than this. 

Proceeding now to the consideration of the quantitative law 
of the s|K>ntaneous recovery of experimental extinction as a func- 
tion of time, we turn to another portion of the Ellson experiment 
described in the preceding section of the present chapter (p. 268 ff.). 
Four groups of 2b albino rats were trained by means of an equal 
number of food reinforcements to depress a horizontal bar (Fig- 
ure 61) imtU the habit was thoroughly learned. They were then 
printed with the bar and permitted to operate it without rein- 
foreement until a period of five minutes elapsed without recorded 
pi^ur^ FoUowing this one group was extinguished to the same 
a seamd time after a recovery period of 5.5 minutes, another 
graip was spun extinguished after 25 minutes, a third group after 
65 minute, and a fourth group after 185 minutes. The solid circles 
in Figure 62 show the median number of unreinforced reactions 
mjtiired again to produce extinction after the several recovery 
l^ricKis. II is evident at a glance that spontaneous recovery is 
comiderable in amount, and that it is an increasing fimction of 
the diiratioa of the recovery j^riod. 

la m to dd-ermine this relationship more precisely, a 

|M«live pwih function was fitted to th^ values; this is repre- 
Mste hf iim curve drawn among the ^lid circte. While 

^ Mmhy mo w^am perf^ it is evident that the spontaneous 
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recovery from primary experimental extinction effects is approxi- 
mately a simple positive growth function. An inspection of this 
curve shows that at 185 minutes it practically reaches its asymp- 
tote, or limit of rise. It is also to be noted that this maximum is 
only about SO per cent of the original strength of the habit, which 
is indicated by the broken line at the top of the figure. 

We saw in the last section (p. 267ff.i that extinction effects 
manifest reaction generalization, which raises the question of the 



Fi<s. Graphic representation of Ellen’s empiiif^l valtt^ for 
^Kmtaneons recovery of a habit from primary experimental extinetion (lower 
curve, ^lid circles) and from r^ction-generali^d experimental extinctkm 
efects (upper curve, hollow circles). Both cur\'^ reprint i^ple pcmtive 
growth functions fitted to the circles through which th^ (Plotl^ from 
data published by Mtoa, 4 .) 

quantitative law regarding the spontan^us recovery of the reaction 
generalization of extinction effects. This problem al^ was inves- 
tigated by Ellson as a portion of the experiment just 
Four additional grouf^ of animals were extinguished on the ver- 
tical-bar habitj and then after recovery pericMis of (K, and 

185 minute they were extinguish^ on the horizontal-bar habit. 
TTie m^an numte* of tmreinforced reactions require to prcKioee 
experimental extineticHi are shown by the hollow circle of Fig- 
ure 62. 

A simple growth function was to these values; this is 
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represented by the smooth, broken-line curve drawn among the 
circles. Here again the fit is not very close, though there is an 
indication that spontaneous recovery from reaction-generalized ex- 
tinction effects approximates a simple growth function. It is to be 
noted that this curve also approximately reaches its limit of rise 
at 185 minutes, which, incidentally, almost exactly equals the 
number of unreinforced reactions required to extinguish the hori- 
zontal bar when this extinction is not preceded by the extinction of 
the vertical-bar habit. Moreover, it is probably significant that 
the rate of rise of this emve is very close to that of the one fitted 
to the data derived from the spontaneous recovery of primary ex- 
tinction effects; the fractional rate of change (F) of the latter is 
1/21, and that of the former is approximately 1/24. 

THE niSHTHIBITION' OF EXTINCTION EFFECTS 

A second phenomenon which demonstrates that experimental 
extinction to the point of zero reaction does not necessarily abolish 
a reaction tendency permanently and completely is that known as 
disinhibition. This was pointed out by Pavlov, in whose labora- 
tory the phenomenon was originally discovered. The nature of 
disinhibition is nicely illustrated by the following account. 

Dr. Zavadsky, one of Paxdov's pupils, experimented with a dog 
which had two salivary fistulas, one from the submaxillary and 
the other from the parotid gland. Through repeated presentations 
and ing^tions, the sight and odor of meat powder presented at 
a distance had become conditioned to evoke the salivary reaction 
of both glands. Thereupon occurred the events summarized in 
Table 7, wrhere it may be seen that the first three stimulations 
were not reinforced. An extremely rapid experimental extinction 
resultiwi, shown by the fact that at the third stimulation the reae- 
fcima was %em. On the fourth trial, however, the presentation of 
the meat powder accompanied by an “extra” stimulus in the 
form of a cutaneous vibration. In this case three drops of saliva 
wem ^reted, which indicates that the inhibition was partly abol- 
ish^ (disinhibited). Five minutes later when the meat powder 
wm pmmntM it was accompanied by knocks xmder the table; dis- 
mMbition was again manifested by a secretion of two drops. After 
five miaiite tie meat |K)wder was presented alone ; the zero reac- 
tiOT to this indicates that the extinctive inhibition had returned, 
following the preceding disinhibition. This observation illustrate 
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T.ABLE 7 

SrMMARY OF Dk. ZaVADSKY’? EXPERIMENTAL ResCLT;-^ IlLCSTEATING BoTH 
Simultaneous and PER.>5E\T;R.iTivE Aspects of iiihiNHiBixioN (y, p. §5j.* 


■ Amount of Saliva in Drops 
During One Minute 

JcSrence Stimulus Applied During One Minute 

From Submaxil- F rom Paro- 
lary Gland | tid Gland 


1:53 P.M. Meat powder presented at a distance 11 

1 :58 P.M. Meat powder printed at a distance 4 

2:3 P.M. Meat powder presented at a distance 0 

2:8 P.M. Same 4- tactile stimulation of skin ... 3 

2:13 P.M. Same 4- knocks under the table 2 

2 : 1 8 p.SdL Meat ^wder at a distance 0 

2 P.M. Prof. Pavlov enters the room contain- 
ing the dog, talks, and stays for two 

minutes 

2:23 P.M. M^t powder at a distance 2 

2:2Sp.m. Same 0 


* Previoiis to this experimect it had been ^own repeatedly &at iseitiser the tatetHe nm &e 
auditory stimulus, nor the entry of Professor Pavkiv into the expmmcntai room, prodooed any 
secretory effect at alL 


the transitory nature of disinhibition, which presumably is due to 
the fading out of the stimulus trace of the extra stimulus. It is to 
be noted (Table 7) that no disinhibition was follow'ed by a reaction 
nearly as great as the eleven drops evoked at 1:53 f.m. preceding 
the experimental extinction. 

The reactions just considered are cases of stmidtaneouB disin- 
hibition. In spite of its clearly transitory nature, disinhibition 
shows a certain tendency to perseveration, or after-effect. The 
entrance of Professor Pavlov into the experimental room for two 
minutes served as an obvious external stimulation. One minute 
after he had left the meat powder wm presented at a distance and 
it evoked the conditioned reaction of five drops ; this illustrates the 
perseverative effect of disinhibition. Five minutes later, however, 
the same stimulus produced no reaction, w'hich shows that the jw- 
^veration was distinctly brief in duration. 


SUMMARY 

The conditions under which reinfoi^ements CKJCur inevitably ^t 
up many receptor connections that are false in the sense that if 
the stimulus elements in question should alone evoke the reaction, 
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reinforcement would not follow. The fimctional corrective of this 
unadaptive aspect of the law of reinforcement is experimental ex- 
tinction. This consists of a progressive weakening of the reaction 
tendency whenever the evocation of the reaction is not followed 
by adequate reinforcement. Experiments suggest that the reaction 
tendency diminishes as a negative growth function of the number 
of closely successive unreinforced evocations. In one study in 
which analysis was made of the relevant learning curves, it was. 
found that the rate of loss through extinction was much faster than 
the rat-e of the original acquisition of the reaction tendency. 

Just as positive habits manifest stimulus generalization, so do 
extinction effects also manifest this tendency. Moreover, the gradi- 
ent of the generalization of extinction effects is of such a nature 
that if the incremental factor (F) in the two cases were the same 
the extinction of a habit at the point of reinforcement on the 
stimulus continuum would completely neutralize the reaction ten- 
dency, not only at that point but at all other points on the stim- 
ulus continuum to which it would show primary stimulus general- 
ization. 

In case experimental extinction occurs on one wing of a posi- 
tive generalization gradient, the interaction of this with the result- 
ing extinction generalization gradient produces a greatly steepened 
gradient of that portion of the effective reaction tendency lying 
between the point of extinction and the point of the original rein- 
forcement. This steepened gradient leaves the latter stimulus still 
able to evoke the reaction, while the former does not. The result 
is a distinctly heightened power of discrimination. 

In both conditioned-reflex and selective-learning situations, ex- 
tinction eff^ts manifest reaction generalization, quite as do posi- 
tive h^it tendencies. 

ExpOTmental extinction does not necessarily abolish completely 
Mid f^rmanoatly the reaction tendency extinguished; this is shown 
by the phenomenon of spontaneous recovery. Spontaneous recov- 
ery, of both primary extinction and response-generalization extinc- 
tion effects, takes place approximately as a positive growth function 
of time elapsing since the termination of the extinction proc^. 
UiMlar the continuous action of a static conditioned stimulus, spon- 
of both primary mid generalized extinction nearly 
ite matimum at three houm. Maximal ^ontaneous recov- 
^ primly eS^ts under the^ conditions is about 
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50 per cent, whereas that of generalized extinction efiFects is ap- 
proximately 100 per cent. 

A second phenomenon, known as disinhibition, further demon- 
strates that experimental extinction does not necessarily constitute 
an abolition of the extinguished reaction tendency. This means 
that a weak “e.xtra” stimulus will partially restore an extinguished 
reaction tendency. Such restorations are quite transitory, however. 
Disinhibition may partially restore an extinguished reaction ten- 
dency even when the extra stimulus has been delivered several min- 
utes before the attempt is made to evoke the extinguished reaction. 

NOTES 

The Equation Fitted to Hovland’s Curve of Extinction Data 
Following Distributed Practice 

The equation from which was plottai the smooth curve of Figure 57 m: 

= 24-1 + 80.9 X ^ 

where -4 is the ampKtude of the galvanic skin reaction evok^ by the conditioned 
stimulus, and N is the number of the prec^iing extinction repetitions. In thk 
equation the exponential value of .315 corresponds to a factor of reduction (F) 
of 1/1.94. 


The Equation Fitted to Hovland^s Generalized Extinction Data 
The equation from which the smooth curve of Figure 58 was plotted is: 

.4 = 6.7 + 3.25(1 - 10--«^0, 

where A is the amplitude of the galvanic skin reaction evoked by a stimulus, 
and d is the difference in j.n,d.k between that stimulus and the oim to which 
the reaction was originally extinguidied- 

The a>mi^uuble equation fitted to the sanm author^s em^meal data 

on esxUaimTf gaaa:alization effects k: 

A = 12.6-h6XlO--®issrf 

The F-valti^ corresponding to .0135 fe approximately 1/33; that eoir^poGdii^ 
to .01^ is approximat^y 1/21. Unfortunatdy it is not known whether either 
of th^ F-valu^ is cor^tant under all mnditions, and if not, upon what thdbr 
magnitude may depend. 

The Equations of the Curv^ ^K>wn m F%uie 02 

Tte fixation of the curve of the ^xjnta^ons primMy extiiarficm 

effects dbiWn in tite lowrar portion of Figure 62 is: 

222 ( 1 - - 2 , 

wl^e n m tiie numb^ nnrdaifcHO^ r^uir^ to prodo^ the ^ond 

expernnmrtoi extinction, <tnd f " m tim time in nunutes of the recovery period- 
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i.e., from the tennination of the first experimental extinction to the beginmug 
of the second experimental extinction. 

The equation of the curve of sioontaneous recovery from the reaction generali- 
zation of experimental extinction is: 

n-43 - 32X 

where n has the same significance as in the preceding equation, and f'" is tbe 
time in minutes of the recovery period, i.e., from the termination of the extinction 
of the vertical-bar habit to the beginning of the extinction of the horizontal-bar 
habit. 

The rate of lise to its asymptote of the curve represented by the first equation 
(F) is approximatdy 1/24; that of the second is 1/21. 
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CHAPTER XVI 

Inhibition and Effective Reaction Potential 


In the last chapter we considered experimental extinction as 
the mechanism which protects organisms from the evil effects of 
the imadaptive habits inevitably set up by the law of reinforce- 
ment. In the present chapter we shall again take experimental 
extinction as our point of departure; here, however, it will be 
regarded as a secondary phenomenon which arises under certain 
special conditions from the logically more primitive principle of 
reactive inhibition. Our expository procedure will be to state in a 
semi-formal manner certain principles, more or less physiological 
or submolar in nature, according to which reactive inhibition is 
believed to originate, operate, and disintegrate, and to accompany 
them with illustrative evidence. These submolar principles, it is 
to be noted, are not properly a part of the present system, which 
is molar, but are intended as a kind of background. Following this 
there will be presented a series of corollaries flowing from these and 
other principles of the system.^ In this way an attempt will be 
made to show how the principles explain, and therefore integrate, 
an appreciable variety of relevant empirical phenomena. Finally 
two primary molar principles will emerge and will be formally 
stated as such at the end of the chapter. 


QUANTITATIVE CONCEPTS AND P R E LI MINARY STATEMENT OF 
PRIMARY MOLAR AND SUBMOLAR PRINCIPLES RELATED 
TO INHIBITOEY POTENTIAL (lj{) 

Although the physiology of response inhibition is far from clearly 
known, a great deal of knowledge of a submolar nature has been 
discovered during the last quart® century. Our account of this 
subject wiE therefore proceed at first with submolar principle as 

^For eipositOTy purposes the prehmin^y propositions and the corol- 
lariffl whidi flow from them are mc^ tes alteimted in the foUowing 
It will be noted that the prelimmaiy prop<Mtioiis are indicated by 
capital letters, whereas the corollai^ we mdieated, as in other chapters, 
by ]11 IXQ€Xb1s. 

OT 
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a background, the Mowrer-Miller hypothesis^ being taken as a 
point of departure. While this hypothesis has a number of com- 
ponents which appear in various parts of the present chapter, the 
main or critical proposition may be stated as our first preliminary 
or sufamolar principle. 

A, Whenever any reaction is evoked in an organism there h 
left a condition or state which acts as a primary negative motimr 
tion in that it has an innate capacity to produce a cessation of the 
activity which produced the state. 

We shall call this state or condition reactive inhibition. Frcnn 
a quantitative point of view, reactive inhibition will be repre- 
sented by the symbol 7^. Just as bEr symbolizes a certain quan- 
tity of reaction evocation potential, so Ir symbolizes a certain 
potentiality of inhibition, i.e., a certain quantity of inhibitory 
potential. The reaction decrement which we have attributed to 
reactive inhibition obviously bears a striking resemblance to the 
decrements which are ordinarily attributed to ^Tatigue.*^ It is 
important to note that 'Tatigue” is to be understood in the present 
context as denoting a decrement in action evocation potentiality, 
rather than an exhaustion of the energy available to the reacting 
oi^n ( 17 ). 

From the foregoing it is evident that inhibitory potential (7^) 
is an unobservable and so has the status of a logical construct with 
all the advantf^es and disadvantages characteristic of such scien- 
tific concepts. In this connection it will be recalled (p. 21 ff.) 
that the prime prerequisite for the proper use of imohservables in 
scientific theory is that they be anchored in a quantitatively unam- 
biguous manner (a) to observable antecedent conditions or events, 
and (b) to observable consequent conditions or events. 

Piweeding at once to the satisfaction of the first of thm 
r^uirm^ta ^ in m far as the present state of our ignorance per- 

^ A brief statemeit of Dr. Mowrer's version of the Momer-Miller hypoth- 

k in an article by him^If and Mi^ Jones (/5) ; Dr. Millra^a 

vefson m pi^^ted in his recently pi±>lished book (U, p. 40 ff.). We are 
mTOb iailebted to Dr. Mowrer not only for material appearing in the pres- 
ent ^tionj bnt for idc^ ottered Ihrooghout the entire chapter; however, 
te tb® jmriicnto fonnuhttion of the hypothesis here presented, in so far as 
it difer^ frcan the vifews of Eh", Mowrer and Dr. Miller, as well as for the 
of mmi of the ^arollaries derived in oim way or another 
ir«n it, the audKar tak^ entire r^ponsibility. 

requirement nc^ only of fee pr^nt nnc^b^rvable ■ but cf 
a ef ^idoyed m mrl&et riiapters will be taken no in Chap- 
ter TmL ^ 
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mitSj we arrive at our second preliminary or submolar proposi- 
tion: 

B, The net amount of functioning inhibitory jmtmtial remiitmg 
from a sequence of reaction evocations is a positively aeceleraied 
function of the amount of work ijri involved in the performance 
of the response in question. 

Stated still more speeiiicallTj it may be said that the weight 
of the evidence at present available indicates that 

_ cn 
B-W' 

w’fiere n is the number of reaction evocations involved, c and B 
are empirical constants, and 

w - ri, 

in which represents force and L represents distance or length 
of the movement, as in ordinary mechanics. 

It is evident that the mean net increment of inhibition per rein- 
forcement must be the net inhibition divided by the number of 
reaction evocations. From this consideration and the basic equa- 
tion for J, it follows that for a given organism the mean net incre- 
ment of inhibition must have a constant value for a given value 
of Wf i.e., it must be 


B - W 

It is to be noted in this connection, hovrever, that the inhibitory 
potential resulting from a series of motor response is not a simple 
matter of mechanics, that it does not dej^nd merely ujx?n the 
force (F') and distance (L) involved in the movement. This is 
precluded by the constant, c, in the relationship. For example, it 
is presumable that for a given amount of energy consumption such 
as is required for the repeated lifting of a heavy weight, the value 
of c would be larger for the weak muscular si-’stem involved in the 
Sexing of the little finger than for the relatively strong muscular 
system which flexes the arm at the elbow. 

The relationship of energy exj^diture or work (IF) to tibe 
accumulated inhibition Im arising from a ^uence of unreinfoi^^ 
reaction ev<Kjations, such as occur in esj^rimental extinction, is 
convincingly demonstrate in an inveigatiem reporte by Mowrer 
and Jon^ (15). Three comparable groups of albino rats vrere 
traine on a Skinna* ty|^ of apparatus to press a bar for fcN^d 
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pellets. The bar was so constructed that different weights could 
be attached to it requiring the animal to make any desired mini- 
ma! pressure before the food W’ould be delivered. The animals of 
all three groups were given equal preliminary reinforcement with 
bar weights of o grams, 42.5 grams, and 80 grams. Following this 
the three groups were extinguished during three periods of 20 min- 
utes each, with 24 hours intervening between each extinction period. 

One group had the bar 
weighted throughout the 
process exclusively with 5 
grams ; the second group had 
the bar weighted with 42.5 
grams; and the third group 
had the bar weighted with 
80 grams. The mean number 
of unreinforced reactions 
made by each of the respec- 
tive groups of animals is 
shown in Figure 63. There 
it may be seen that the num- 
ber of extinctive reactions 
performed under constant 
conditions of habit strength 
and motivation is approxi- 
mately an inverse linear 
function of the work in- 
volved in the act. Evidence 
reported by Crutchfield (;^), 
from a rather different ex- 
l^imental situation, also suggests an inverse linear relationship. 
This relationship may be expressed rather precisely by a trans- 
potion of the expre^on for I given above, i.e., 

^ . Ub - W) 

c 

Since we are committed to a centigrade scale in the present 
it follows that the maximum of inhibitory i>otential must 
arbitrarily have a value of 100. Accordingly we take the unit of 
iniibitoiy potaitial as that amount of Ir which will just neutralize 
mm umi of reaction jK)tential. This unit will be called the pav, 
a syllable from the name Pavlov. It is suggested that the term pav 



Fro, 63. Graph showing the relation- 
ship of the number of unreinforced reac- 
tions performed by albino rats in one hour 
to the amount of involved in each 
reaction. (Plotted from data published 
by Mowrer and Jon^, 15.) 
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be pronoiinced to rhyme with have. The pav is accordingly defined 
in terms of the wat, thus: 

1 wat + 1 pav = OA 

Having formallj^ specified the value of Ijj, we must now inquire 
what changes take place in it with the passage of time. It will 
be recalled that inhibitory potential is assumed on the submolar 
level to have its physical basis in a negative motivational con- 
dition or state. This quite probably depends on a substance resi- 
dent in the effector organs involved in the response. Now, it is to 
be expected that such a substance will gradually be removed by 
the blood stream passing through these organs. Moreover, the 
amount of this removal per imit time following the cessation of 
the action should be proportional to the amount of inhibitory sub- 
stance present at any given time. This, of course, is equivalent to 
saying that the dissipation of Ib will take place according to a 
simple decay or negative growth function (p. 199 ff.) of time. We 
thus arrive at our third preliminary proposition: 

C. Each amount of inhibitory potential (Ir) diminishes pro^ 
gressively with the passage of time according to a simple decay 
or negative growth function. 

THE CONDITIOIQ'ING OF INHIBITORY POTENTIAL, ITS STIMULUS 
GENERALIZATION, AND THE CUNCEPT OF EFFECTIVE 
REACTION POTENTIAL 

At this point in our analysis we need to emphasize a somewhat 
different aspect of the above principles from that employed in the 
derivation of the preceding two propositions. The new emphasis 
will be on that portion of the Mowrer-Miller hypothesis (prelimi- 
nary Proposition A) which states that the after-effect of reaction 
evocation is a primary negative motivational state or condition. 
This means that the after-effects of response evocation in the aggre- 
gate constitute a negative drive strongly akin to tissue injury or 
^^pain.’’ If this is the case, we should expect that the cessation of 
the ^^nocuous” stimulation in question or the reduction in the 

^It must be obsen^ed that the formal precision of the definition of the 
unit of inhibition is superficially deceptive in that it is programmatic rather 
than an accomplished fact. This statement holds for the imits of habit 
strength and all similar units employed in the present work. It is believed 
that the use of such units even in a merely formal and programmatic sense 
adds to the clarity of the exposition and will contribute ultimately to an 
empirically workable operational definition. 
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inhibitory substance, or both, would constitute a reinforcing state 
of affairs. The response process which would be most closely asso- 
ciated with such a reinforcing state of affairs would obviously be 
the cessation of the activity itself. In accordance with the 'law 
of reinforcement” (p. 80 ff.) this cessation of activity would be 
conditioned to any afferent stimulus impulses, or stimulus trac^, 
which chanced to be present at the time the need decrement oc- 
curred. Consequently there would arise the somewhat paradoxical 
phenomenon of a negative habit, i.e., a habit of not doing some- 
thing (i4 p. 40ff.). Thus we arrive at our first corollary: 

I. ^imvR dosely associated v>ith the acquisition and accumi^ 
lotion of inhibitory potential (1^) become conditioned to it in such 
a way that when such stimuli later 'precede or occur simultaneously 
with stimulus situations otherwise evoking positive reactions, these 
latter excitatory tendencies will be weakened. 

Fortunately the existence of such habits is well authenticated, 
having long ago been demonstrated experimentally by Pavlov in 
what he called conditioned inhibition. Pavlov reports {16, p. 77 ff.) 
that a tactile stimulus was conditioned in a dog to a defensive 
salivary secretion produced by an injection of weak acid into the 
animaFs mouth. In addition, an alimentary conditioned reflex was 
mi up to the ticking of a metronome by having the latter followed 
by feeding. Then the metronome and a neutral stimulus in the 
form of a whistle were repeatedly presented together without rein- 
farcemoat, which produced experimental extinction of the condi- 
tioned alimentary reaction. Now, according to the present hypoth- 
^is this extinction process should become conditioned to any 
neutral stimuli associated with it, notably the whistle; the remain- 
der of the experiment proved this to be the case. After presenting 
tile tactile stimulus a couple of times to demonstrate the strength 
of ilB powa* to evoke the salivary reaction, the experimenter pre- 
saitai it for tiie fiist time in conjunction with the whistle. The 
quantitative rmilts are shown in Table 8. There it may be seen 
tiiat at 3:16 the tactile stimulus evoked a secretion of 8 drops in 
cue minute, whereas at 3:25, when the tactile stimulation was 
aimbinM with the whistle, the secretion was less than one drop. 
At 3:^ the tactile stimulus, now applied alone, evoked 11 drops, 
wMch still furthar emphasise the inhibitory effects of the whistle 
m tte c<»nbmation pr^ented at 3:25. It had, of course, 

be® dffimnstratol that previous to association with the extinction 
^f tite m^tiOT^une-salivaiy ixmditioned reaction the whistle did not 
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TABLE 8 

This Tabi£ Pkesenis Expehimentai. Results Illusteating the Action of 
Conditioned Inhibiteon (Evoeed by a Whistle) in Largely Suppressing the 
Evocation of a Heterogeneous Conditioned Reaction. (From Pavlov, 16 , 
p. 77.) 


Time 

Stimulus Applied During 

1 ^linute 

Salivary Secretion in Droj^ 
During 1 Minute 

3:08 p.M. 

tactile 

3 

3:16 p.M. 

tactile 1 

8 

less than 1 drop 

11 

3:25 P.M. 

tactile + whistle i 

3*.30 p.M. 

tactile i 

i 


produce external inhibition (16, p. 77) of the tactile-evoked sali- 
vary reaction. 

In continuing the discussion of conditioned inhibition it must 
be pointed out that Corollary I has introduced a new dimension 
into the concept of inhibitory dynamics; this is to the effect that 
the influence of inhibition on behavior evocation may be controlled 
by a stimulus. Such a possibility requires the employment of a 
new sjunbol which will explicitly express this fact. We shall do 
this simply by adding to the symbol for ordinary inhibitory poten- 
tialy h, the letter S as an extra subscript, thus: sIr. 

But the moment the action of inhibitory potential (sIr) is cued 
to a stimulus, the su^estion arises from the analogy to rHr, and 
so to sEr, that inhibitory potential will also manifest stimulus gen- 
eralization; this brings us to our second corollary: 

II. Conditioned inhibitory potential (rIr) will manifest stim-- 
idvs generalization in a manner exactly analogous to that of reac-^ 
lion potentiality (sEr), as given in Postulate 5. 

The appearance of two forms of inhibition on the theoretical 
scene instantly raises the question as to how they combine, which, 
in turn, requires the introduction of the concept of total inhibitory 
potential; this will be represented by the symbol Ir. In this way 
we arrive at the statement of our fourth preliminary proposition: 

i>. Simple reactive inhibition (Ir) and conditioned inhibition 
(sIr) summate functionally to produce Ir as would corresponding 
amounts of habit strength (p. 223) . 

With the concept of total inhibitory potential available, it be- 
comes necessary to introduce explicitly the concept of effective reac- 
tion potential; this is represented by the symbol The intro- 
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duction of this concept brings us to the statement of our fifth pre- 
liminary proposition: 

E. The effective reaction 'potential (sEr), i,e,, that reactim 
potential which is actually available for the evocation of action 
(R), is the reaction potential {sEr) less the total inhibitory poten- 
tial (Jr). 

Pinally, with Proposition E available it becomes possible to 
derive the empirically known law of spontanecms recovery. This 
concept was employed by Pavlov (16, p. 58) to designate the well- 
known fact (see p. 271) that conditioned reactions which had suf- 
fered experimental extinction tended in the course of time spon- 
taneously to recover a considerable proportion of their ori^nal 
effective reaction potentiality. By Proposition C, the inhibitory 
potential (Ir) operating against any given response (R) disinte- 
grate according to a simple negative growth function. But since 
(Proposition D) is a summation of Ir and and since (Propo- 
sition E), 

sEr = sEr — jfi , 

it follows that rEr will increase as Ir decreases, which brings us 
to our third corollary: 

III, Other things equal, an effective reaction potential (rEr) 
wMck has been reduced by the accumulation of inhibitory potential 
(Ir) will recover spontaneously through the mere passage of time, 
the course of the recovery being a simple positive growth function 
of the time elapsing since the termination of the final response of 
the series which produced the inhibition in question. 

Ample verification of Corollary HI in the case of extinctive 
inhibition was seen above (p. 270 ff. and Figure 62) . 

THB of the stimulus generalization and 

OF THB SFONTANiX)US RECOVERY OF EXTINCTIVE INHIBITION 

Now let it assumed explicitly, as was done tacitly in the 
derivation of Corollary I, that whereas rIr involves the whole 
neural rw^eptor-effector mechanism of habit, Ir involves only (or 
mainly ) the eff^tor portion of this mechanism. Accordingly it is 
to be exf^tai that Ir would not manifest the generalization gradi- 
m% characteristic of habit, but would display its presence by a 
cai^nt miMMint of diminished effective reaction potential of the 
of the stimulus evoking the reaction. This con- 
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stant or non-diminishing amount of reactive inhibition at all points 
on the stimulus generalization gradient of inhibitory potential 
brings us to our fourth corollary: 

IV. When a habit has been set up by well-distributed reinforce- 
ments and extinguished by massed evocations^ the asymptote or 
limit of rise of effective reaction potential (sEb), due to the stim- 
tdus generalization of the conditioned inhihitionj will always be less 
than the strength of the effective reaction potential just previous to 
the extinction. 

A second implication arising from the differential characteristics 
of Ib and sIr hinges on the empirically established principle that 
in animal experimentation habits are relatively immime to for- 
getting, whether set up by the conditioned reaction technique or 
by selective learning (7). This means that (Proposition C) Ib 
will manifest the phenomenon of spontaneous dissipation as a 
function of time, whereas gin, being a habit, will not to any great 
extent. These considerations, coupled with Corollary III and the 
equation, 

= -ffi + sUb}^ 

bring us to the conclusion that spontaneous recovery of sI^b 
occur only in so far as the extinctive inhibition is comprised of 
Ib. Thus we arrive at our fifth corollary: 

V. In case a reaction potential iaER) has been set up by dis- 
tributed reinforcements and extinguished by massed evocations, 
spontaneous recovery of bEb will be incomplete. This we have 
seen to be true in the case of EUson’s investigation (lower curve, 
Figure 62) . 

But since stimulus generalization is based on b^r or the habit 
aspect of reactive inhibition, it is not to be expected that the 
purely stimulus generalization of extinctive inhibition would show 
spontaneous recovery; this brings us to our sixth corollary: 

VI. Where effective reaction potential {bIEb) falls below simple 
reaction potential (sEjj) by reason of purely stimulus generaliza-- 
tion, i.e., from the action of conditioned inhibitory potential (bIr), 
the effective reaction potential (bEb) will display no spontaneous 
recovery whatever. No evidence bearing directly on Corollary VI 
has l^n found. An experimental test of its soundness would 

^The sign + indicates physiological summation according to equations 32 
and 43. 
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accordingly be of special value in determining the validity of the 
numerous assumptions ultimately involved in its derivation. 

Ify liowe%’erj the difference between sEr and qEb is due to purely 
reactive inhibition^ i.e., to the action of Ir, then complete spon- 
taneous recovery should take place; which brings us to our seventh 
corollary: _ 

Where eifective reaction potential {sEr) falls below sim- 
pAe reaction potential {sEr) by reason of the purely response gen- 
errdizatim of extinctive inhibition^ spontaneous recovery will ocawr 
and it will be complete. 

The spontaneous recovery observed in one portion of Ellson^s 
exj^riment (^) , that represented by the upper curve of Figure 62 
{p. 271) , presumably took place under conditions which approached 
of response generalization. Accordingly it may, or may not, 
be significant for the validity of Corollary VII that spontaneous 
recovery was approximately perfect. 


THE SrCCESSHTJ EXTINCTION OF THE SAME REACTION 
POTENTIAL 

From Corollary I it follows that Ir and sIr are generated con- 
currently. In cases where experimental extinction is complete, i.e., 
where (Propositions D and £), 

sEr = sEr — (Jr 4 " sl^ ^ 0, 

it follows that 

sEr — -ffi + sIr 

This rais^ many intriguing questions as to the relative amounts 
of two supposed forms of inhibitory potential that are gen- 
caratei under various conditions. As yet little experimental evi- 
cteice ^neeming this matter exists, although Ellson’s results sug- 
that during the initial extinction the two forms of inhibition 
in roughly equal amounts. 

Prom the precoiing considerations it may be concluded that if 
an organism is subjected to massed extinctive evocations every five 
or mx hmm, ^y, there will be an appreciable amount of both Ir 
asd sIr generated in the fimfc extinction. Six hours later the Ir 
wfll lar^y have b^n dii^pated, but, since habits do not dism- 
wi^ tto of time, the sIr will remain, so there 

will be m api^mably diminished amount of reaction potentiality 
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available for a second extinction; this will again generate both Ir 
and But since there will be less sEjt to be extinguished, less 
of both Ib and sin will be generated than on the first occasion, so 
that fewer extinction evocations and less time will be required. 
Since there will be less and less Jj? generated at each new extinction, 
there will be less and less spontaneous recovery after each recovery 
period. Thus we arrive at our eighth corollary: 

VIII. In case a reaction tendency (sEr) is subjected to the 
same criterion of experimental extinction by massed evocations at 
uniform intervals, the amount of spontaneous recovery manifest at 
each successive extinction will progressively diminish until ulti-- 
mately there may be no spontaneous recovery whatever, the number 
of unreinforced evocations required to produce a given degree of 
experimental extinction on the successive occasions being approxi'- 
mately a negative growth function of the ordinal number of the 
extinction in question. 

The e\ddence bearing on the soundness of Corollary VIII ap- 
pears to be internally inconsistent. The necessities of adaptive 
dynamics demand that an organism shall not continue forever to 
waste its energy performing acts which yield no need reduction; 
this is in agreement with the general observation that organisms 
do in fact ultimately give up completely the performance of such 
acts. On the other hand, there appear to be situations, notably 
certain ones involving secondary reinforcement (p. 84 ff.), in 
which a considerable amount of spontaneous recovery continues to 
(xrcur very many times. A striking case in point has been reported 
by Fitts (4). This investigator extinguished rats on a Skinner bar- 
pressing habit repeatedly at intervals of a week or longer. He 
found that the first three extinctions followed approximately the 
course deduced above, but the fourth extinction showed a statis- 
tically reliable increase in recovery which persisted, though with 
a gradual diminution, for five further extinctions. It seems likely 
that this reversal in Fitts' curve is due to some complex secondary 
reinforcement mechanism involving the stimuli arising from frac- 
tional antedating goal reactions, though the mechanism if^lf has 
not yet been worked out in detail. 

THE PHENOMENON OF DISINHIBITION 

If it is true, as implied by Corollary I, that conditioned inhibi- 
tion (gis) is a negative habit, it is to be expected that the occur- 
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rence of an extra stimvlus (an unaccustomed stimulus element) 
in the stimulus compound (S) would, through afferent neural inter- 
action (p. 42 ff.), produce a diminution in the si it- This kind of 
causal mechanism is, of course, exactly that which presumably 
gives rise to what Pavlov calls external inhibition (see p. 217 ff.). 
T\Tien applied to inhibition itself, Pavlov calls the action of the 
extra stimulus disinhibition { 16 , p. 61 ff.). This brings us to our 
ninth corollary: 

IX. Whenever a stimvlus element not customarily present in 
a compound stimulus (S) conditioned to an inhibitory tendency 
(sIb) occurs in such a compound, the amount of inhibitory potential 
evokable by the new combination wUl be less than that normally 
evoked by the stimulus compound originally conditioned to the 
inhibition. 

Since any change in the inhibitory potential can become mani- 
fest only indirectly through positive action of some sort, it follows 
from the relationship, 

sEs = S^B ““ {Ir + slid} 

that if the extra stimulus produced the same amount of reduction 
in as in rIm} the two effects would exactly offset each other; 
i.e., no change whatever could occur ‘in and therefore no 
change in observable behavior could result. Nevertheless, disin- 
hibition is a well-authenticated empirical phenomenon. This seems 
to require the assumption of a special susceptibility of conditioned 
inhibition to being upset by extra stimuli. In this connection Pav- 
lov remarks that 

, . . the inhibitory proe^ is more labile and more easily affected than 
the excitatory proce^, being influenced by stimuli of much weaker physio- 
Ic^cal strs^th. ( 16 , p. 99.) 

He cit^ expOTmentel evidence which purports to substantiate this 
view; i.e., which shows that a weak stimulus will weaken the sIb 
cmly, whereas a stimulation which includes a strong '‘extra” stim- 
ulus element may evoke no reaction whatever since it would also 
alM3!ish the and m the sEr, by "external” inhibition. 

If Pavlov's assumption of the differential action of weak extra 
stimuli on rIr and rEr is accepted, an important implication fol- 
lows at (mm inm the relationship, 

“ sEjt — (Jb. 4 " slid- 

we mtiwe at our tenth corollary: 
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X. If €L reaction potential (sEr) has been partially or wholly 
extinguished, the inclusion of a mild extra stimulus in the condU 
tioned stimulus compound (S) will result in the strengthening of 
the effective reaction potentiality (sEr), 

Blit since sIb constitutes only a portion of the total inhibition 
(Jjj) which weakens sEr, it follows that disinhibition can only par- 
tially restore sEb to the original value of sEb; this leads to our 
eleventh coroliaiy’-: 

XL When an excitatory habit has been set up by means of 
well-distributed reinforcements and has then been extinguished, 
the most effective disinhibitory stimulus possible will never {except 
through “oscillation” — see p. 304 j5^.) enable S to evoke an R with 
as great vigor, certainty, or speed as before the extinction occurred. 


INHIBITION' OF REINFORCEMENT 

It is evident from the above version of the Mowrer-Miller 
hypothesis (Proposition A) that reactive inhibition must be gen- 
erated whenever reactions are evoked, whether reinforcement occurs 
or not. If reinforcement does not occur, the inhibitory potential 
generated by the response may be called extinctive inhibition; the 
inhibition generated if the response is followed by reinforcement 
has been called by Hovland, inhibition of reinforcement {8), from 
the circumstances of its origin, even though the inhibition is pre- 
sumably not dependent upon the reinforcement process. If rein- 
forcement occurs, the consequent increase in habit strength {bHb) 
will so increase the reaction potential {sEr) that when the in- 
hibitory potential {Ir) arising from the successive reactions is 
deducted, the effective reaction potential {^r) will usually be 
superthreshold in amount, i.e., more than enough, given normal 
stimulation and motivation, to evoke the reaction. 

Now, inhibitory potential can be observed only indirectly 
through the failure to occur of some positive reactions which the 
antecedent conditions would otherwise produce. Because of this 
fact and of the usual over-riding effects of reinforcement, it hap- 
pens that inhibition of reinforcement usually does not manifest 
itself in any very dramatic manner such, for example, as in the 
total cessation of reaction evocation so characteristic of experi- 
mental extinction. Probably it is because of these circumstances 
tiiat relatively few investigators have noticed it. As might be ex- 
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pected, it was Pavlov who seems first to have observed and de- 
scribed this phenomenon. He remarks: 

The development of inhibition in the case of conditioned reflexes which 
remain without reinforcement must be considered only as a special in- 
stance of a more general case, since a state of inhibition can develop also 
when the conditioned reflexes are reinforced. The cortical cells under the 
influence of the conditioned stimulus always tend to pass, though some- 
times very slowly, into a state of inhibition, . . . This inhibition is of the 
same character as the internal inhibition which has been d^eribed in 
previous lectures, and it exhibits the same properties of irradiating to 
other cortical elements which are not primarily involved. {16, pp. 234- 
2 ^) 

In the above context Pavlov describes an illustrative experiment 
in which a conditioned reflex in the course of a number of rein- 
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Rki. 64. Giapis lowing the course of the acquisition of the conditioned 
IM refex m & funetioii of the time interval separating the reinforcements. 
CfVan Calvin, 1, as iH:i^ented by Hilgard and Marquis, 7, p. 149,) 

foreemente gjvai rather close together actually diminished to a 
lero strength {se^ Figure 64 ). 

Thus we arrive at our twelfth and thirteenth corollaries: 

XIL Whenever conditioned reactions are evoked, whether rein^ 
md, reacdvm inhibition (Ij^) is generated. 

XnL Im the mm of dosely massed remforcements, the curve 
of of effective mMatory 'j^terdiaL (sFje), particularly 
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in its later stages, mil be distorted by inhibition of reinforcement 
below the learning curve of sEs, in extreme cases showing an actual 
fall with continued practice. 

Calvin reports such a cur^^e (Figure 64), in which the rein- 
forcements occurred at the rate of eighteen times per minute; this 
curve not only ceases to rise after about 25 reinforcements, but 
actually shows a slight fall. 


THE INITIAL RISE IN THE CURVE OF 
EXPERIMENTAL EXTINCTION 

In 1930 Switzer {18) reported a novel form of experimental ex- 
tinction curve, based on the extinction of the conditioned lid reac- 
tion. Instead of falling 
abruptly from the initial un- 
reinforced reaction as in Fig- 
ure 57, the curve showed at 
first a sharp rise in ampli- 
tude of reaction (Figure 65). 

After one or two more unre- 
inforced reactions this initial 
rise was followed by the fall 
usually encountered in the 
extinction process. Hudgins 
{9; 10, p. 439) and others 
have fully verified Switzer's 
original discovery. Hovland 
{8) has reported the out- 
come of an ingenious experi- 
ment which he interprets as 
explaining the phenomenon 
found by Switzer. The habit 
involved was a galvanic skin 
reaction produced by an electric shock which had been conditioned 
to an auditory stimulus. The results of the experiment are shown 
concisely by the four graphs appearing in Figure 66, graphs B and 
C being of special significance. Graph B, which is a curve of ex- 
perimental extinction following massed reinforcements, shows what 
purport to be the Switzer effect; graph C is a curve of experimental 
extinction following a form of distributed reinforcements and shows 
a clo^ approximation to the conventional curve of extinction. Hov- 
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Pig. 65. Graph, showing the Switzer 
phenomenon. Note the^ initial rise in the 
composite curve of experimental extinc- 
tion of a conditioned eyelid reaction. 
(Plotted from pooled values published by 
Switzer, 18 , p. 86.) 
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land interprets these results as indicating (cil that massed rein- 
forcements in considerable numbers leave at their conclusion a rela- 
tively large amount of “inhibition of reinforcement''; (b) that the 
transition from reinforcement to non-reinforcement acts as a disin- 
hibiting agent, producing disinhibition of the accumulated inhibi- 
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Fig. 66. Extinction curves following various conditions of reinforcement. 
R^ults plotted in terms of ratios (in per cent) of responses on successive 
extinction trials to respond on first extinction trial. (A) 8 reinforcements. 
Extinction immediately. (B) 24 reinforcements. Extinction immediately. 
CC) 24 remfon^ments, distributed into 3 groups of 8 each. Rest period of 
^ minutes between groups. Extinction immediately after last group of 
remfor^ments. (D) 24 reinforcements. Extinction 30 minutes after last 
reinfoi^meat. (Reproduced from Hovland, S, p. 431.) 


tioa of reinforc^ent. This accounts for the initial rise shown in 
graph jB. 

The set of assumptions outlined in the preceding pages of 
the pr^ent chapter imply the initial rise of the curve of experi- 
mmtal artinction in the following manner: (a) massed reinforce- 
mmiM generate a relatively large amount of Ir and consequently 
a steoB^ native drive; (b) each pause between reinforcements, 
mm if brief, a slight reaction in the need (for inactivity, 

m ; ic) this m a rrinforcing state of affairs setting up 
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a certain amount of conditioned inhibition (^1^) ; (d) the sudden 
transition from reinforcement to non-reinforcement, on the first 
non-reinforced stimulation, withdraws from the afferent impulses 
which customarily were present at previous reinforcements, the 
stimulus traces of the shock; (e) this change in the make-up of the 
afferent compound, through the principle of afferent interaction (p. 
42 ff.), is sufficient to produce disinhibition of the conditioned 
inhibition of reinforcement; which (/) results in the initial rise in 
the curve of experimental extinction. Thus we arrive at our four- 
teenth corollary; 

XIV. When conditioned reactions are set up by means of 
massed reinforcements, conditioned inhibition is generated which, 
at the outset of extinction, is disinkibited through the change in the 
functioning afferent impulses, with the result that the curve of 
experimental extinction shows an initial rise. 

While Corollary XIV agrees in the main with Hovland^s experi- 
mental findings, there are certain respects in which it does not. 
The greatest single inconsistency is shown in graph D, which rep- 
resents the extinction of a conditioned reaction based on 24 massed 
reinforcements, 30 minutes after the conclusion of the reinforcement 
series. Since disinhibition is assumed to be based on slji, which 
is regarded as a habit, and since ordinary habits do not disinte- 
grate appreciably in 30 minutes (7), there should have been about 
as much sIb to be disinhibited in the case of graph D as in that 
of graph B, which clearly is not the case. The discrepancy is of 
considerable importance, since it points to a serious defect in the 
postulates which generate Corollary XIV. This matter clearly 
needs further intensive experimental investigation, particularly 
from the point of view of the assumed sIb- 

THE LAW' OP LESS WORK 

One of the traditional methods employed in the investigation 
of the gradient of reinforcement (p. 137 ff.) has been to give an 
organism the choice of two paths to the attainment of some sort 
of reinforcing agent, such as food, and study its behavior as rein- 
forcements of the two behavior sequence accumulate. In one of 
the best and most recent investigations of this kind, Grice (d) con- 
cludes in effect that if the temporal factor (upon which the gradi- 
ent of reinforcement is formulated) were completely equalized, 
there would still be a marked preference for the shorter path. In 
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of no practice there will exist a net increased effective reaction 
potentiality (sEr) ; this increase has received the not too appro- 
priate designation of reminiscence. Thus we arrive at our eight- 
eenth, nineteenth, twentieth, and twenty-first corollaries: 

XVIII. In case a simple conditioned reaction is set up to m 
appreciable strength by massed practice and the final reinforce- 
ment is followed by a no-practice period several times as long as 
the interval between reinforcements^ after which the stimulus is 
again delivered, motivation remaining constant, the reaction-evoca- 
tion potentiality of this stimulus unll be greater than it was at the 
termination of the original reinforcement sequence, 

XEX. In the case of simple conditioned reactions, reminiscence 
if plotted as a function of time will approximate a simple growth 
function, 

XX In the case of rote series learned by massed practice, 
reminiscence will rise at first with a negative acceleration, which 
will presently be replaced by a fall (11, pp. 261, 263, 277). 

XXI, The closer the massing of the reinforcements in the origi- 
ncd learning, the greater will he the extent of the reminiscence 
effect 

The phenomenon of reminiscence in the conditioned reflex set 
up by means of massed practice was, as usual, first described by 
Pavlov (16, p. 249). A rather elaborate quantitative investigation 
of the phenomenon with human subjects, incidental to the study 
summarized in Figure 64, has been reported by Calvin (1). He 
found after a no-practice period of 24 hours a mean increase of 
from 5,75 reactions per ten trials to 7.10, a gain of 1.35 points, 
where the reinforcements were three per minute; where the rein- 
forcemente were eighteen per minute he found an increase of from 
2.^ to 7.^ reactions, a gain of 4.95 points. This shows an appre- 
ciable advantage in reminiscence for the more closely massed rein- 

Hie phenomenon of reminiscence in rote learning has been 
known for mmiy yeai^. The last and most precise investigation 
of reniniscence in this form of learning was reported by Ward (Bl), 
^ If we apply Corollary XVIII to any learning situation, it is 
that when reinforcements are separated by time intervals 
of iMKierate length, the greater these intervals, the greater will be 
tile mammt of mgmlmmmB iwovery during the intervals, i.e., tiie 
iwi inhibitory potential at the end of learning, and 

teefom m win be the effective habit strength at the 
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conclusion of the last reinforcement. This is known in learning 
literature as ''the economy of distributed repetitions.’" An excellent 
demonstration of this principle is fimnished by the Calvin study (1) 
Ixjsi referred to. Using human subjects, this investigator condi- 
tioned to a light stimulus the lid-closure reflex originally evoked by 
a shock below the eye. One group of 20 subjects received rein- 
forcements at the rate of three per minute, another group at the 
rate of nine per minute, and a third group at the rate of eighteen 
per minute. The course of the learning in terms of the per cent 
of presentations of the conditioned stimulus evoking the conditioned 
reaction was as indicated in Figure 64. These curves show that 
the rate of learning where three reinforcements were given per 
minute was much faster than “where reinforcements were given more 
closely massed. Many parallel experiments in various fields of 
learning, particularly in the rote learning of nonsense syllables 
{11, p. 127 ff.) , have demonstrated the same type of economy. 

From the preceding considerations we accordingly formulate our 
twenty-second corollary: 

XXII. Within limits, the greater the time interval separating 
the reinjoTcements of learning, the greater mil be the effective ex- 
dtatory potential {b^b) dt the conclusion of the last reinforcement. 

SUMMARY 

The Mowrer-Miller hypothesis states in effect that all responses 
leave behind in the physical structures involved in the evocation, 
a state or substance which acts directly to inhibit the evocation of 
the activity in question. The hypothetical inhibitory condition or 
substance is observable only through its effect upon positive reac- 
tion potentials. This negative action is here called reactive inhibi- 
tion. An increment of reactive inhibition {aIr) is assumed to be 
generated by every repetition of the response {R), whether rein- 
forced or not, and these increments are assumed to accumulate 
except as they spontaneously disintegrate with the passage of time. 
The magnitude of the individual increments, and therefore of the 
rate of accumulation, appears clearly to be in part a positively 
accelerated increasing function of the amount of energy consumed 
by the response. 

Because of the motivational characteristics of reactive inhibit 
tion, or inhibitory potential, it is opposed to reaction potential 
(sEr) rather than to habit (bHr) , as is sometimes supposed. Thus 
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effective reaction potential (sSb), the potential actually available 
for the evocation of action, is the reaction potential less the in- 
hibitory potential. 

Since under ordinary learning conditions response and rein- 
forcement occur in parallel, the strengthening of the habit due to 
reinforcement usually is great enough to over-ride the accumulating 
inhibition. As a consequence, inhibition of reinforcement is only 
detected by special means. In case little or no reinforcement fol- 
lows the reaction evocations, extinctive inhibition soon neutralizes 
the reaction potential, the stimulus gradually ceases to evoke the 
response, and there ensues the state known as experimental extinc- 
tion which thus appears as a secondary or derived phenomenon. 

The Mowrer-Miller hypothesis regards reactive inhibition as 
essentially a need to cease action, i.e., a need for rest; it follows 
that anything which reduces this need should serve as a reinforcing 
state of affairs. Since the cessation of action reduces the afferent 
proprioceptive impulses generated by it in the presence of the 
inhibiting condition, particularly when many responses have gen- 
erated a considerable amount of inhibition, it comes about that the 
cessation of action, rather than action, becomes conditioned to 
whatever stimuli may be present. In this way we find a plausible 
explanation of conditioned inhibition (sIb) and of the stimulus 
generalization of extinction effects. There are a number of indi- 
cations that phenomena analogous to conditioned inhibition and 
stimulus generalization of inhibition occur under conditions of 
ordinary learning reinforcements, though not all the empirical evi- 
dence harmonizes with this a priori expectation. For this reason 
the theory of the origin of sIb must be regarded with somewhat 
more than tie usual amoimt of distrust. 

B^aui^ conditioned inhibition (sIr) is generated as a secondary 
effect from the accumulation of reactive inhibition (1^), it follows 
iimt at least in extinction atuations both Ib and will result. 
A^uming that tihe two summate physiologically, it follows that at 
cimplete ^i^rimental extinction the excitatory potential (sEs) 
will he opposai or neutralized in part by Ib and in part by 
How, Ib di^pates spontaneously through the passage of time, but 
teing a lame habit, pr^umably does not, at least to any great 
Tl© di^pati(^ of Ib will prcduce i^>ontaneous recovery of 
but this will naturally result in only partial 
other hand ihe ^ond inhibitory component in 



INHIBmON-EFFECnVE REACTION POTENTIAL 299 

extinction (s/s) should be subject to external inhibition. Since 
eh is responsible for only a portion of the depression of s^b below 
sEb, disinhibition, which presumably operates only on sIb, also 
should never produce complete recovery. The slight initial rise in 
response rigor when extinction follows massed reinforcements is 
plausibly interpreted as the external inhibition of the conditioned 
inhibition isis) presumably set up during the reinforcement proc- 
ess. The facts, however, are not wholly in harmony with this 
interpretation. 

In case a reaction tendency {bEr) is extinguished a good many 
times, each extinction being performed by massed practice on sepa- 
rate occasions, the gradual accumulation of the relatively perma- 
nent conditioned inhibition implies that the time required for the 
successive extinctions of the reaction tendency should grow less and 
less, the minimum approaching zero as a limit. 

The magnitude of h, and also, presumably, of sIr, generated 
by a given number of response evocations depends upon the amount 
of energy consumption or work (F) involved. This implies that 
of two or more alternative behavior sequences repeatedly executed 
by the organism in the attainment of an ordinary reinforcement 
that sequence will finally come to be chosen which involves the 
less work or the less tissue injury. This is the important “law of 
less work,” which, as pointed out by James {12), accounts for the 
prevalence of “laziness” in the behavior of organisms. 

Because reactive inhibition (/b) dissipates spontaneously 
through the passage of time, it follows that a part of the “inhibi- 
tion of reinforcement” will dissipate during the pauses which occur 
throughout learning by distributed reinforcements or repetitions. 
The less the 1b, pres^ably, the less wiU be the bIr, and so, cer- 
tainly, the less the Ir and consequently the greater will be the 
bEr at the end of the learning process. Thus is explained the 
well-known empirical law of the economy of distributed repetitions 
m learning. If, on the other hand, a considerable number of rein- 
forcements are massed and then a pause occurs, the same principle 
leads to the frequently observed empirical phenomenon of spon- 
taneous recovery of effective reaction potential {sEr) known as 
“reminiscence.” 

In view of the considerations, molar and submolar, put forward 
in the precedmg pages, we now formulate our eighth and ninth 
primary molar principles: 
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POSTULATE 8 

Whenever a reaction (R) is evoked in an organism there is created as 
a result a primary negative drive (D ) ; (a) this has an innate capacity 
(Ir) to inhibit the reaction potentiality (sEr) to that response; (b) the 
amoimt of net inhibition (/r) generated by a sequence of reaction evoca- 
tions is a simple Hnear increasing function of the number of evocations 
(n ) ; and (c) it is a positively accelerated increasing fimction of the work 
(BO involved in the execution of the response; (d) reactive inhibition 
IIr) spontaneously dissipates as a simple negative growth function of 
time (f"). 

POSTULATE 9 

Stimuli (S) closely associated with the cessation of a response (R) 
(a) become conditioned to the inhibition (Ir) associated with the evoca- 
tion of that response, thereby generating conditioned inhibition ; (b) con- 
ditioned inhibitions (s/jr) summate physiologically with reactive inhibi- 
rion (Ir) against the reaction potentiality to a given response as positive 
habit tendencies summate with each other. 


NOTES 

> Mathematical Statement of Postulate 8 
(a) sEr * sEr ““ /a (^) 

(fe) (37) 

(0 (38) 

whew W^F’L (39) 

(<J)‘"7, = J^ (40) 

Mariifiinatical Statement of Postulate 9 

(a) (41) 

( b ) sEr « bEr — Ir. ^'***'^ 


The Mowrer-Jones Graph 




is. 


® li amshct* to produce extinction, 

m pre^ire m gnusa required at each reaction. 
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It may be noted in this connection that the form of the above equation is 
not exactly that shown as (37) above. A transformation of the former equation 
to agree with (37) is. 


116.6 - W 
.3125 


(43) 


where 116.6 » R, and .3125 


Equations Expressing the Laws of the Disintegration of Reactive 
Inhibition (ijj) and of the “Recovery^* of Effective 
Reaction Potential 

The negative growth or decay principle according to which reactive inhibition 
is a^mned to disintegrate (Proposition C) is: 

^ Is X 10-^'". 

Since, 

sEs “ s^s (Is + sis), 

it follows that jsEs after the lapse of time, f ", will be, 

sEs = sEr — (Ir 4* sTs) 

^ sEr- (Ir - 10 -^"" 4 - als), 

from which it follows that sEr will with sufficient time "recover" substantially 
the amount to which it is depressed by Ir, but not the amount to which it is 
depressed hy sis] and the recovery will take place according to the exponential 
or poative growth function of time (i'")- 


Problems Connected with the Conditioning of Inhibitory Potential (^/jj) 

The theory of the generation of sis from Ir presents a number of problems 
which probably cannot be cleared up without a considerable amoxmt of co- 
ordinated research. One major problem may be stated as follows: If the ces- 
sation of contraction can serve as a reinforcing state of affairs, why does this not 
serve to set up habits involving muscular contraction, as well as inhibition? Such 
an implication seems to be perfectly legitimate, and it presents numerous in- 
triguing possibilities. In this connection it is important to recall the nature of the 
gradient of reinforcement (p. 139 ff,), which is to the effect that the process most 
clo^ly preceding the reinforcing state of affairs will be the one most strongly 
ranforced. On this principle, the cessation of the "nocuous" stimulation from a 
muscle 'will reinforce most strongly the cessation, relaxation, or inhibition of the 
act which produced the discharges and distinctly less the active contraction which 
nece^aiily has preceded the relaxation. Thus while some reinforcement of 
excitation leading to positive reaction potential (sEr) would result from the 
<^^tion of “fatigue” stimulations arising from muscular ac'tion, a much greater, 
and therefore a dominating, amoimt of conditioned inhibitory potential (sIr) 
would be generated. 

A second and stall more complex problem may be stated as follows: If the 
<^a1aon, relaxation, or inhibition of action is susceptible to being conditioned to 
^muii, as the phenomenon of conditioned inhibition in conditioned reflexes 
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strongly sugg^ts, why does not an ordinary reinforcing agent, such as the reduc- 
tion of the food need, reinforce inhibitory, quite as much as excitatory, tendencies? 

While this problem requires a much more thorough examination than can 
be given, here, a few suggestions may be made. In the first place, there is much 
evidence that conditioned inhibition is generated in extinction situations, and 
the initial rise of the extinction curve suggests that conditioned inhibition is also 
evolved in ordinary reinforcing situations- It is conceivable that some of this 
conditioned inhibition is set up by means of the mechanism just described. The 
critical question concerns the relative amounts of positive reinforcement {sHs) 
of negative reinforcement (s/je) which will be generated according to ^ 
present set of hypotheses. If inhibitory tendencies were reinforced the same as 
excitatory tendencies, the two might simply neutralize each other, in which case 
no positive efiiective reaction potentials would develop, and ejffective learning 
could not occur. Such an outcome would be implied by the naive suppoaHion 
that reinforcement does not take place until after the cessation of the act rdn- 
forced. For example, in the Skinner procedure the animal does not eat unifl 
after the ce^tion of the muscular contraction which depresses the bar. Acto- 
ally, winle thk is true of primary reinforcement, it is also true that a large porlion 
of the reinforcement in such learning is secondary in nature, and this secondary 
reinfarcement, e.g., the click of the magazine, occurs during the contraction and 
preceding the relaxation. In this connection it is well to recall evidence of the 
occurrence of reinforcement when the reinforcing state of affairs precedes the 
reaction reinforced. Both the work of Thorndike {19 ^ p. 35) and that of Jenkins 
(15, pp- 58a and 72a), however, have shown that the forward wing of this double 
gradient is rdativdiy much lower than the backward one. The difference in the 
strength of remforcement in the two positions should give the conditioned excita- 
tory tendency {sE^) something like the advantage over the conditioned inhibitory 
tendency (gjfi) that experiment shows it to have in fact. 

This quesrion will evideniiy require much further investigation before a 
ecmfiteit de<arion can safely be made r^arding numerous aspects of reactive 
infaiHticm in adaptive bdiavior. Indeed, the present chapter may be considered 
to be largjely an analytical exploration of a very rich field, preliminary to such 
a comndinated research program. It is also to be remembered that this uncer- 
tainty es^tiaHy concerns the sabmolar basis of Postulate 9; from a logical 
point all of tins difficulty is eliminated when we take this proposition as a 

prim^ law, i.e,, as a separate pc^tulate. If, as seems likely. Postulate 9 
m ri^pKHiE^y derivable from other principles of the system, it will become a 
and tihe number erf primary principles will thereby be reduced by one. 
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CHAPTER XVII 


Behavioral Oscillation 

It is an everyday observation that organisms vary in their 
performance even of well-established, habitual acts from occasion 
to occasion and from instant to instant on the same occasion. We 
are able to recall a name at one time but not at another; in shoot- 
ing at a target we ring the beU at one shot, but not at the next; 
and so on. 

When a six-place number is divided by a five-place number 
repeatedly by an automatic calculating machine, the exact identity 
of the quotient obtained on all the occasions is taken for granted; 
if the average person were to perform the same divisions, using 
pencil and paper, he would consider himself lucky if exactly the 
same result were obtained each time. While first-rate calculating 
machines sometimes get out of order and make errors, ordinary 
inorganic mechanisms under the same external conditions show, in 
general, much less variability in behavior than do organisms. In- 
deed, variability, inconsistency, and specific impredictability of 
behavior have long been recognized as the chief molar distinctions 
between organisms and inorganic machines. Clearly a character- 
istic «) fundamental as this must find an important place in any 
adequate theory of organismic behavior. 

KXMIRIMENTAL DEMONSTRATIONS OF BEHAVCOEAL OSGILLATTON 

Even when the strmigth of a reaction potential has become 
stabilized at a value well above the reaction threshold, and the 
conditioned stimulus evokes its reaction with a considerable degree 
of consistency, both the amplitude and the latency of the reaction 
always (ocillate from trial to trial. This is illustrated nicely by 
data from an unpublished study performed in the author’s labora- 
tory by Ruth Hays and Charles B. Woodbury. In this investi- 
gation a hungry albino rat was placed in a Skinnef type of appa- 
ratus (Figure 61, p. 268), the single pressure bar of which was 
provided with a recording dynamometer. In the first part of the 
experiment the apparatus was so set that the animal was required 
to midce a prmure of 21 grams before the food-pellet reward would 
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be deliTered. Four days of training were then given, on each of 
which the animal received 100 reinforcements; on the day following, 
under exactly the same conditions, the animal made the distribu- 




INTEMSITY OF PRESSURES IN GRAMS 

Fig. 67. Two reaction-intensity distributions of a rat in a bar-pressing ex- 
I^riment, designed to illustrate the oscillation of reaction potential (sEb). 
See text for description of experimental procedure. (From an unpublished 
study performed in the author’s laboratory by Rutti Hays and Charles B. 
Woodbury.) 

tion of pressures stown in the upper portion of Figure 67. There 
tie solid circles represent the pressures which were followed by food 
reinforcement, and the hollow circles represent the pressures which 
were too weak to deliver the food pellet. This distribution shows 



PRINCIPLES OF BEHAVIOR 


306 

the phenomenon of intensity variability of a single organism within 
a short period of time under practically identical external stimn- 
lating conditions; the maximum pressure was more than three times 
as great as the weakest, and about twice as great as the minimum 
required to yield food pellet delivery. The upper portion of Fig- 
ure 67 also suggests that the distribution of oscillating reaction 
intensities conforms approximately to the normal 'daw^' of chance. 

Further illustration of the same general tendencies is presented 
by the lower distribution in Figure 67. This shows the variability 
of pressures by the same animal in the same apparatus after four 
more days of 100 reinforcements each with an apparatus adjust- 
ment which required the animal to make a pressure of 38 grams 
before the food pellet would be delivered. Here we find an even 
wider range of variability in pressure intensities than that shown 
in the upper distribution. Again there is a general tendency for 
the distribution to conform to the normal ^^law” of chance, though 
in this case the conformity is not so close as in the upper distribu- 
tion. Several other animals trained like the one whose record has 
been ^ven above, showed exactly similar tendencies in all respects. 
Moreover, Hill (^), in connection with an investigation directed 
at an entirely different objective, secured results on the conditioned 
eyelid reaction which iucidentally showed that both amplitude and 
reaction latency under relatively constant conditions display an 
c^cillation clc^y comparable to that revealed by the rat pressures 
of Figure 67.^ 

Tables publish^ by Thorndike { 10 ) based on a line-drawing 
experiment where learning apparently was effectively precluded by 
human subjects whose eyes were closed, substantiate the results 
yielded by Hays and Woodbury's rats. Thorndike instructed his 
subjecte to draw what seemed to them to be a two-inch line, for 
example. One sub]^ drew 1,697 of these lines. This distribution 
shoTO gr^t variability or c^cillation, with the majority of the 
leigths falling in the central re^on much as shown in Figure 67. 
Thorndike's distribution reveal^, however, a long, thin, as3?m- 
mefcrical tail on the side of the longer lin^. Since the rat data 
mentioned above give no suggestion of this asymmetry, one may 
r^^nably conjecture that it was produced by some factor in the 
sitaation other than the primitive oscillation ten- 
A furtiyar examination of TTiorndike's tabl^ reveals one 

in a piirale and reported here with Dr. Hill's 
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subject who drew a large number each of two-inch, four-inch, and 
six-inch lines. These comparable distributions indicate a progres- 
sive increase in the range of oscillation in the length of line drawn. 
Additional study shows that the range of oscillation in this series 
has an approximately linear relationship to the central tendency 
of the lines actually drawn. Here again there is essential agree- 
ment -with the rat behavior shovm in Figure 67. 

A striking, though indirect, indication of the oscillation or 



Fig. 68. Graphic representation of the empirical per cent of successful 
reactions at the presentation of the cue syllable in the case of the learning of 
265 nonsense syllables in series. The complete learning of each of the 
syllables in question was preceded by six failures of reaction evocation (with 
reinforcement), i.e,, six presentations of the cue syllable which were not 
followed by the correct response. (Keproduced from MatheTTuxticch-DediLctive 
Theory oj Rote Learning, 5, p. 162,) 


yariability of behavior potentiality is presented in the early stages 
of most simple conditioning situations where the experiment is so 
set up that conditioned and unconditioned reactions are clearly 
distinguishable. Under these circumstances it is common for the 
conditioned stimulus to evoke its reaction on one occasion^ yet not 
on the next, in spite of the fact that because of intervening rein- 
forcement the effective habit strength must have been stronger at 
ttte time the failure occurred than it was at the time of the preced- 
ing success (4). 

Notwithstanding the seemingly fortuitous occurrence of reac- 
tions during the early stages in the acquisition of receptor-effector 
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connections; the probability of evocation actually increases con- 
tinuously with the increase of the effective habit strength and, 
since drive is presumably constant^ of effective reaction potential 
This may be shown by pooling the evocation results medi- 
ated by a large number of sHr connections which increase in 
strength at approximately the same rate. There is revealed by 
this procedure not only a progressive increase in the probability 
of reaction evocation but a characteristic sigmoid curve of increase. 
The outcome of such a procedure is shown in Figure 68. The nor- 
mal chance distribution of Figure 67 in the learning situation just 
described would produce exactly the sigmoid learning curve seen in 
Figure 68. 

The above results taken as a whole indicate that the same 
external stimulating conditions, operating through an approximately 
identical habit structure (sHr) ond cl relatively constant drive (D), 
and so an approximately constant effective reaction potential (sEb), 
will evoke distinctly diverse reactions. 

ASYKCHROIOSM OP BEHAVIORAL OSCILLATION 

The phenomena represented in Figure 68 are presumably pro- 
duced by the gradual rise of the effective strength of a habit above 
the reaction thr^hold. In such a situation, if the b^r chances to 
c^cillate so as to be above the reaction threshold at the moment 
of stimulation, overt reaction will occur; if it chances to fall below 
the reaction threshold, the reaction will not occur. Reaction will 
follow ev&ry stimulation only on condition that the momentary 
eff^jtive reaction potential exceeds the reaction threshold by an 
amount greater than the range of oscillation below it. 

There is a related case in which the problem is concerned not 
with Ihe CMimrrence or non-occurrence of a reaction but with which 
one of two or more competing incompatible reactions^ will be 
evoked by a givm stimulus situation when the effective reaction 
pstential of each exceeds the reaction threshold by an amount 
greater ^an the range of oscillation below it. Under such condi- 
tlcms each reaction tendency, if not interfered with by another 
mcompatible one originating in the same stimulus situation, would 
nmliate its imdaou at every stimulation. Suppose, now, that to 
thoK ecuMiitioi^ there be added a further one, namely, that the 

^ ime&Ms are which cannot be ereented at the same 
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habit strength of one of two competing reaction tendencies exceeds 
that of the other, but by an amount less than the range of oscilla- 
tion below it. It follows that if the competing effective reaction 
tendencies both oscillate upward or dovmward at the same time, 
i.e., in synchronism, the one with the strongest habit strength will 
always dominate, its reaction occurring at every stimulation and 
the other reaction not occurring at all. If, however, the oscillations 
of the two reaction tendencies are asynchronous or at least are not 
perfectly correlated, then there may be expected an irregular alter- 
nation, the relative frequency of the occurrence of each reaction 
being an increasing function of the difference between the respective 
habit strengths (see Table 3 and Figure 36, pp. 147 and 150) , 

Experiment reveals the latter state of things rather than the 
former. It follows that in a trial-and-error learning situation of 
this kind, where the strength of one of several competing reaction 
tendencies is steadily increasing with respect to the others, domi- 
nance by this reaction tendency would be attained gradually rather 
than abruptly; this also is a fact. The outcome of such a process, 
where the ultimately dominant habit was at the outset relatively 
weak, is shown in Figure 24 (p. 108) , It is abundantly clear that 
the oscillation of effective habit strength is, to a considerable extent,, 
asynchronous. 

POSSIBLGB STJBMOLAR CAUSES OF BEHAVIORAL OSCILLATION 

There is reason to believe that one of the ultimate physio- 
logical or submolar causes of molar behavioral oscillation lies in 
the variability in the molecular constituents of the nervous system, 
the neurons. Blair and Erlanger have foimd, as the result of ex- 
ceedingly delicate experiments on frog nerves, that neural response 
thresholds and reaction latencies of individual axon fibers vary 
spontaneously from instant to instant. They report: 

When a preparation containing a fiber of outstandiag irritability is 
stimulated with shocks increased in strength by small steps from below 
the fiber’s threshold, there are at first only rare response. To eiieit 
a spike [electrical reaction] with every shock it usually is nectary ta 
increase the strength further by about 2 per cent. 

They report further: 

The time intervemng between successive threshold shocks and the 
resulting conducted axon spikes, or the shock-spike time, under exacfiy 
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comparable conditions is not constant. The range of fluctuation in the 
case of the more irritable fibers may be as great as 0.5 or, but usually k 
about 0.2 to 0.3 a; in the case of the less irritable fibers it may be more 
than 2.4 cr. . . . (Ij pp. 530-531.) 

If to the spontaneous oscillation in irritability of the neural 
conduction elements which mediate behavior, as reported by Blair 
and Erlanger, there be added the random and spontaneous firing 
of the individual neurons throughout the nervous system, which is 
indicated by the experimental evidence reported by Weiss (see p. 
45, above), there would seem to be ample grounds for expecting 
oscillation to be a universal characteristic of organismic be- 
havior (iJf). 

The investigations just cited suggest not only the physiological 
cause of behavior oscillation but its characteristic distribution. 
Mathematicians have shown that the results of the joint action 
of a multitude of independently varying small factors tend to 
distribute themselves according to the so-called Gaussian or “nor- 
mal law'' of probability {2). Thus if 16 coins are tossed simul- 
taneously a large number of times, and the number of heads com- 
ing up at each toss is recorded, it will be found that the most com- 
mon number obtained will tend to be 8 and that the frequency of 
the other numbers of heads per throw tapers off symmetrically as 
zero and 16 heads per throw are approached. In a similar manner 
the spontaneous neural oscillations favoring a reaction stronger 
than average may be thought of on the analogy of the heads of 
the coins, whereas those favoring a reaction weaker than average 
may be considered analogous to the tails of the coins. Accordingly 
it is to be expected that in most cases the two opposing tendencies 
will be about equal in number and so will approximately balance 
each other, yielding a reaction of medium intensity. Occasionally, 
however, a disproportionate number of neural phases favoring 
strong or weak reactions will occur, just as a higher than average 
number of heads or tails in the coin-tossing experiment sometime 
appears. On such occasions an unusually strong or an unusually 
weak reaction will be made, e.g., an imusually long or an unusually 
shcnrt line will be drawn. This, of course, agrees very well with 
the facte of grc^ behavior variability as r^resented in Figure 67, 
thinp equal, then, we may ^ect that the nuignitvde of the 
e€mim€i^m of each mtisde involved in an act mediated hy a recep- 
emmecti4m wUl vary as a fimction of the normal law 
of 
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SOME FURTHER COlSTSEQUElsrCES OP THE SIMIJLTAJTEOXJS 
FORTUITOUS VARIATION OP A VERY LARGE NUMBER 
OP INDEPENDENT PACTORS 

Mathematicians (;0, p. 33 ff.) have determined by appropriate 
methods the outcome of what amounts to a coin-tossing experiment 
in which the number of coins is infinite and an infinite numb^ 
of throws are made. The results of this mathematical procedure 
are conveniently presented in the form of a table, which is ex- 



DEVIATIONS FROM CENTRAL TENDENCY IN a UNITS 


Fig. Graphic repr^ntation of the distribution of normal probability 
to which the intenaties of reactions evoked by repetitions of the same stimii- 
lits are fc^iieved to approach as a first approximation. This figure has be^ 
plotted mainly from columns a, b, a\ and b' of Table 9. Note the bell-dhaped 
cmtour d the distribution. 

tremely imeM for refo^ce, as it ^ves the standard form of di^ 
tiibuticm toward which all atuations involving the action of nu- 
mm«is chance factors approach- An abbreviated adaptation of 
such an a^^mblage of theoretical chance values is shown in Table 9. 

A graphical representation of one aspect of this particular 
pha^ of probability, that of the chance that the joint action of 
toe factors will deviate by a given amount from the central ten- 
m ©vffl in Figure 69. This figure was derived from Table 9 
1^ toe in columns b and 6' as a function of the 

vali^ to a and n'. Figure ^ should be compared with 

F%ure ®r. It will be ncd^ that Figure 69 has a smooto 
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contour, whereas Figure 67 varies by coarse steps; this difference 
is due to the infinite number of infinitesimal factors upon which 
Figure 69 is based. 

The graphical representation of a second aspect of the joint 
action of an infinite number of small independent chance factors 
is given in Figure 70. This figure was also derived from Table 9, 
by plotting the values in col- ' 

nnrns d and d' as a function 
of the values in columns c 
and c'. Figure 70 should be 
compared with the empirical 
graph shown in Figure 68. It 
will be noticed that Figure 
68 resembles Figure 70 in 
that it is markedly sigmoid 
in shape; this fact strongly 
supports the hypothesis that 
a distribution of many inde- 
pendent chance factors pro- 
duced the oscillation which 
occurred during the learning 
of the nonsense syllables, 
from the results of which 
Figure 68 was derived. Fig- 
ure 68 differs markedly from 
Figure 70 in that the upper 
portion of the S-shaped 
curve is much more extended than is the lower portion. This 
presumably is the result of the slower rate of habit strength acqui- 
ation as it approaches its physiological limit (see p. 116). 



DEVIATION FROM 2.5 a PROBABILITY 

Fig. 70. Graphic representation of the 
cumulative per cent of normal probability 
from 2.5 u to +2.5 cr. This figure has 
been plotted from columns c, d, c, and cT 
of Table 9. Because of its shape, this 
representation of the probability function 
is called the ogive. 


THE MOLAR CONCEPTS OF BEHAVIORAL OSCILLATION (sOjj) AND OF 
MOMENTARY EFFECTIVE REACTION POTENTIAL (iSi?) 

At this point of our analysis we may formulate the molar con- 
cept of behavioral oscillation. On the basis of the submolar con- 
aderations presented above it is believed that the variability of 
reaction under seemingly constant conditions is due to the action 
of m c^ciliatory force upon the effective reaction potential (sSi?) , 
Tbk c^ciilatory force will be represented by the symbol qOb- The 
momaitary state of sTJji under the influence of will be called 
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the momentary effective reaction potential and will be represented 
by the symbol 

The evidence at present available indicates that sOe rather 
closely approaches a normal chance distribution. A number of 
other critical matters in the situation are much less clearly evident. 
Among these latter problems is the question: Does the range of 
oscillation vary for a given organism, and if so upon what does 
this variability depend? This problem is particularly acute as 
the value of rises from zero to the reaction threshold. Closely 
related is the question of the direction of the momentary shift of 
gEs under the influence of sOr- For example, the action of ^Or 
mi^t be wholly positive, causing ^r always to oscillate upward; 
or its action might be distributed equally in both the positive and 
the negative direction. The lower portion of Figure 67 together 
with the conditions and facts of the acquisition of skill, which is 
believed to be closely related to the situation which yielded that 
distribution, suggests that sBr may oscillate both upward and 
downward. On the other hand, the shape of certain curves of 
simple trial-and-error learning suggests that the action of rOr on 
rEr may be wholly negative and that its range may be substan- 
tially constant. 

Partly as a means of facilitating exposition, but partly also as 
a means of opening the associated problems to much needed inves- 
tigation, both empirical and theoretical, it has been decided to 
try out here the conceptually rather simple hypothesis suggested 
by the cuiv^e of simple trial-and-error learning, namely that rOr 
u an oscillating inhibitory potentiality y that it acts against effective 
reactum potential that the distribution of sOr conforms to 

the nomal law of chanccy that the mean value of rOr and its range 
are both armtardy cmd that the action of rOr on the rEr as applied 
to the several individual muscles is non-correlated. 

THE REPLETION OF THE “EEACTION-EVOCATIOIsr'' PARADOX 

At this jK>int we find ourselves in possession of principles ade- 
quate to explain the reaction-evocation paradox; i.e., how a stim- 
uli^ may evoke a reaction w^hich has never been conditioned either 
to it or to a siimulos in its stimulus-generalization range. The 
pwAflaoa may, for mnvemence of exposition, be divided into two 
^rte, (1) ttie r^lution of the reaction-evocation paradox as ap- 
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plied to muscular contraction purely as such (to the so-called 
acimcs of Murray, 6, p. 54 ff.), and (2) the resolution of the same 
paradox as applied to acts, i.e., muscular contractions from the 
point of view of their effects upon the environment, particularly 
as these bear on the subsequent reinforcement of the movements 
in question. 

The ultimate effector molar unit in habitual action is believed 
to be the individual muscle. This means not only that the action 
of each muscle in every coordinated movement must be mediated 
by a separate habit, but that every momentary phase of the con- 
traction of every such muscle (since its proprioceptive cues are 
constantly changing) must be mediated by what is in some sense 
a different habit. Viewed in this manner, the contraction intensity 
of a given muscle, as mediated by the results of a given reinforce- 
ment, is the summation of the action of an uninterrupted chain or 
flux of habit, each phase of which is more or less distorted by the 
oscillation function. In this connection it must be recalled (p. 308) 
that the strength of each habit oscillates largely independently of 
all the others. 

Now, nearly all movements are mediated by the coordination 
of sizable muscle groups. If the contraction of one muscle of such 
a group should vary in its intensity, that of the others remaining 
constant, the joint movement produced by the group as a whole 
will inevitably deviate in one respect or dimension from what it 
otherwise would have been. Since the contraction of each muscle 
is mediated by distinct habits, the contraction of all the muscles 
of a group will oscillate independently. Thus coordinated move- 
ment as such may be said to have as many dimensions of variation 
as there are muscles involved in its production. It follows from 
these considerations that infinitely varied movements other than 
those involved in the original conditioning process will inevitably 
be evoked by the impact of the conditioned stimulus, S. In this 
way the reaction-evocation paradox, from the point of view of 
movement as such, finds its resolution. 

From the point of view of action as defined in the first para- . 
graph of this section, it may be pointed out that behavioral oscil- 
lation gives rise to qualitative as well as quantitative differences. 
For example, if a particular muscle in a group mediating the strik- 
ing of a typewriter key acts too weakly, the impression may be too 
faint to be legible; on the other hand, if some other muscle oscil- 
late in the direction of too strong a contraction, the stroke may be 
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diverted to one side and a quite different key will be hit. In such 
a case a qualitatively different act or outcome may be said to have 
resulted from a quantitative deviation in an actone or movemeni^ 
Thus it is clear that very varied acts may result from the rem- 
forcement of an extremely narrow zone of movements. In this 
way there emerges from the analysis the substance of what may be 
called response intensity generalization. Hence the reaction-evo- 
cation paradox, from the point of view of action and goal attam- 
ment, finds its resolution. 

THE INFLtnENCE OF THE OSCILLATION PRINCIPLE ON THE STATUS 
ANB METHODOLOGIES OF THE BEHAVIOR SCIENCES 

It is quite clear from the foregoing that the concrete manifes- 
tation of empirical laws (such as those concerning the acquisition of 
habit strength, p. 102 ff.) is bound to be greatly blurred by be- 
havioral oscillation. Indeed, at first sight it might be thought 
that behavioral oscillation would preclude the possibility of any 
exact behavior science whatever. As a matter of fact, this pessi- 
mistic view is seriously held in certain quarters. 

It must be confessed that behavioral oscillation does impose a 
grave handicap on all the social sciences; generally speaking, it 
precludes the po^ibility of deductively predicting the exact mo- 
mentary behavior of single organisms. However, with an intimate 
knowl^ge of the history of the organism in question and a good 
understanding of the molar laws of behavior, it should be possible 
to pr^ict within the limits imposed by the oscillation factor what 
the subject will do under given conditions. That behavior pre- 
diction has this limitation may be disappointing to some, particu- 
larly to individuals engaged in clinical practice, but there seems 
no from this difficulty; our task as scientists is to report 

what we find, rather than what we or our friends might wish the 
stuation to 

From the point of view of the general molar laws of behavior, 
the situation is far more satisfactory. Because of the tendency of 
lai^ numl^rs of independent chance factors to distribute their 
influenee more or symmetrically (Figure 69) according to the 
or normal law, it com^ about that by various rather 

ia mcMsI life mtuatioiis remfort^ment will follow about 
a considerable range of variability. Were 
eoiy^tuted could hardly survive. 
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simple statistical devices it is possible, when many comparable 
measurements have been made on the same individuaPs behavior 
or on that of a large number of comparable individuals, to isolate 
the central tendencies from the measures of indmdual reactions, 
more or less distorted as they are by the oscillation factor, and thus 
to reveal a close approximation to the laws which are operating. 
Mathematicians have shown that, other things constant, the dis- 
tortions due to such chance factors vary inversely with the square 
root of the number of observations from which the central tendency 
is calculated. Thus the deviation of a mean calculated from 64 
measures will in general differ from the ''true’’ mean, i.e., that which 
might be calculated from an infinite number of comparable meas- 
ures, by only half as much as that calculated from sixteen meas- 
ures; the square root of 64 is 8, the square root of 16 is 4; and 4 
is half as great as 8. 

On the above principle it is evident that complete absence of 
blurring of the mean, due to oscillation and other chance irrelevant 
factors in the situation, is attained only when the number of meas- 
ures becomes infinite; this means that absolutely exact empirical 
laws are never attainable. It follows that the most that can be 
hoped for in the empirical checking of the implications of be- 
havioral laws must be greater or less degrees of approximation; 
and even this approximation can be attained only at the cost of 
great care and vast labor in the massing of data. Indeed, the uni- 
versality of oscillation in organismic behavior is the main reason 
why the social sciences have been forced so extensively to employ 
statistical methods. 

Finally, it may be said that the principle of behavioral oscil- 
lation is to a large extent responsible for the relatively backward 
condition of the social, as compared with the physical, sciences. 

SUMMARY 

Variability, inconsistency, and specific unpredictability of reac- 
tion under seemingly constant conditions are universal character- 
istics of the molar behavior of organisms, attested alike by general 
oteervation and by quantitative experiment. Neuro-physiological 
inv^igations suggest that behavioral oscillation arises from the 
^M>ntaneously variable action of an enormous number of small 
factors (nerve cells), each acting independently to increase or 
dcKjrease the intensity of reactions mediated by receptor-effector 
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connections. Where learned behavior is concerned, the oscillation 
is presumably of the effective reaction potentiality {sEr), In gen- 
eral confirmation of this view, typical experimental determinations 
show that with the effective habit strength and primary drive sub- 
stantially constant, behavior evoked by successive repetitions of 
the same stimulus presents a close approximation to a normal 
probability curve. Moreover, evidence derived from trial-and-error 
learning situations demonstrates in a convincing manner that the 
oscillation associated with each habit tendency is largely, if not 
totally, uncorrelated with that of the others, i.e., that the oscilla- 
tions of different effective habit tendencies are essentially asyn- 
chronous. 

Additional indirect confirmation of the general Gaussian dis- 
tribution of the oscillation fxmction is found in the sigmoid shape 
of certain learning curves when plotted in terms of the per cent 
(or probabili|iy) of reaction evocation. The characteristically more 
protracted upper portion of these curves is presumably due in the 
main, at least, to the progressively slower rate of habit-strength 
acquisition as the physiological limit is approached. 

Two important implications of the oscillation principle may be 
noted. The first yields an explanation for the superficial paradox 
that a stimulus is able to evoke reactions which are more or 1^ 
distinct from any ever conditioned to it or to any other stimulus 
on the same stimulus continuum. The explanation lies in the ten- 
dmiey of the oscillation function to modify the intensity of every 
mu^ular contraction involved in every coordinated reaction; this 
makes the act evoked more or less different from any act involved 
in the original reinforcement. This amounts in effect to response 
intensity generalization, 

A ^cmd implication of the principle of oscillation is that no 
of oi^anismic behavior can ever be expected to mediate 
tiie praise {Hediction of the specific behavior of any organism at 
a pvai instant. However, because of the general regularity of the 
probability distribution in both its symmetrical and skewed forms, 
it will always be po^ble to predict approximately the central ten- 
dmcies of behavior data from either individual organisms or groui^ 
of cMTganims which are imder the influence of approximately the 
antecedmt factors and which share substantially the smic 
furf ^^Dtar-effector equipment. The oscillation factor ex- 
ph&m why all of Hie behavior sciences derive their empirical laws 
fHKi why thdr quantitative investigations nec^- 
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tate the securing of such great numbers of data. Finally, be- 
ha'V’iorai oscillation is believed to be, to a considerable extent, 
responsible for the present relatively backward state of the be- 
havior (social) sciences. 

On the basis of the considerations put forward in the preced- 
ing pages, we now formulate our tenth primary molar principle: 


POSTULATE 10 

Associated with every reaction potential (sEr) there exists an inhibi- 
tory potentiality (sOr) which oscillates in amount from instant to instant 
according to the normal ‘‘law’’ of chance. The amount of this inhibitory 
potentiality associated with the several habits of a given organism at a 
particular instant is uncorrelated, and the amount of diminution in s^r 
from the action of sOr is limited only by the amount of sfie at the time 
available. 

From Postulate 10 there follows Major Corollary III: 

MAJOR COROLLARY m 

Each muscular contraction involved in any increment of habit tendency 
oscillates from instant to instant in the reaction-intensity potenti- 
ality which it mediates, thus producing a kind of response generalization 
in both directions from the response intensity originally reinforced. 

NOTES 

Mathematical Statement of Postulate 10 
The mathematical statement of Postulate 10 is given by the following equation : 

sEr = sEr — sO'Ri (44) 

where, 

sEr = the momentary effective strength of a reaction potential as modi- 
fied by the oscillatory potentiality, sOr, 

where, 

R — sQb when sEr ^ sOr^ 
and, ^ _ 

sO K = sEr when sEr < sOst 


Jsero S sOr ^ 6 <r, o- being a constant, 

and, 

the probability (p) that sOr takes on values between zero and 6 <r is p, 

wh^ 


P = 


NS ^ 


Car-3<r)^ 

2a^ 
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On the Oscillation of Effective Reaction Potentiality 

A conception of behavior oscillation much like the one here elaborated was 
published by the author many years ago W). Spearman {8, p. 323) incorporated 
it into his system of group factors which are supposed to determine human be- 
havior, but otherwise the idea seems not, as yet, to have found acceptance or 
utilization. It would seem that as the theory of behavior grows more adequate 
general, tbig principle in some form must find explicit recognition. 

The equivalent of the principle of behavior oscillation appears as Postulate 15 
of Ma£kmatic<hDedudwe Theory of Rote Learning (5, p. 74). In that postulate, 
however, it was the reaction threshold which was supposed to oscillate, wh®:^ 
in the present system it is reaction potential (sLJs) which is postulated as vaiying. 

Finally, in the present system the principle of oscillation appears to be shifting 
from the status of a x>ostuIate or primary principle to that of a theorem or seconds 
ary principle. Such a change in the status of the principle of oscillation would be, 
of course, in accordancse with the principle of parsimony, which is to the rffect 
that, oth^ t.biTigR equal, the number of assumptions should be as few as posal^ 


The Derivation of Table 9 and Figures 69 and 70 
Table 9 and Figures 69 and 70 are derived from the equation, 


V = 


NS -A 


V27* 




(45) 


where N is the total number of chances (population) involved, y is approximatdy 
the number of these chances falling within a given interval, S is the extent of that 
interval in <r units, x is the distance of the midpoint of that interval from tl]« 
e^tral point of the distiibution of chances, <r is the standard deviation of the 
dktribution of chances, and tt and e are mathematical constants with approximate 
values of 3.1416 and 2.718 reg)ectively (7, p. 13). 

The method of living cdumns b and ¥ of Table 9 (from which col iimii a d 
and are derived) is illustrated by the following example: 

PfMemi To calculate the population of probability or chances falling within 
a TBiige of jS = ,1 O’ (from x — .05 <r to x -f- .05 <r) the midpoint of which is located 
at a distant, x == 1.35 v, from the central tendency of the distribution of chances, 
fee total po^mlation of chances b^ng N = 100, the standard deviation of the 
distilbutioici of being the unit of measurement of the range of the chances, 

Leu, O’ = L ^ibstituting these several values in the equation, we have, 




100 X.l 

V2 X 3.1416 


2X1* 


X 2.718 


-1.822S 

= X 2,718 * 

= ^X2.718-«» 

= a.9883 X .4(m8 
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which agrees to the second decimal place with the entry in column b, opposite 
that of 1.3 <r in column a in Table 9. This entry covers the range from 1.4 <r to 
that of 1.3 0-, the midpoint of which is 1.35, which is taken as the value of x in 

the above computations. 

Behavioral VariabHity as Caused by the Introduction of Unaccustomed 
Components into the Conditioned Stimulus Compound 

It is probable that an appreciable portion of the gross variability in the re- 
sponse of oi^anisms under approximately constant conditions of habit organi- 
jiadon ari^ from the intrusion of a stimulus component, not present when the 
conditioning originally occurred, into the stimulus complex normally evoking 
the ration. By the principle of afferent neural interaction (p. 42), such an 
intrurion would change to a certain extent the afferent impulse produced by the 
originally conditioned stimulus components. This, by the principle of the 
generalization gradient (p. 185), should weaken the resulting reaction, thus 
producing an oscillation in a downward direction. Numerous other variants in 
the stimulus would produce analogous quantitative variations in action evocation. 


REFERENCES 

1 . Blaie, E. a., and ERnANOEE, J. A comparison of the characteristics of 

axons through their individual electric responses. Amer. J. Physzci 
1933, 106, 524-564. 

2. Brown, W., and Thomson, G. H. The essentials of mental measurementm 

London: Cambridge Univ. Press, 1921. 

3. Hnj j, C. J, Retroactive inhibition in conditioned response learning. 

PhD. thesis, 1941, on file Yale Univ. Library. 

4. Hull, C. L. The formation and retention of associations among the in- 

sane. Amer, J, Psychol., 1917, 28, 419-435. 

5. Him, C. L., Hovland, C. L, Ross, R. T., Hall, M., Perkins, D. T., 

Fitch, F. B, Mathemaiico-deductive theory of rote learning. New 
Haven: Yale Univ. Press, 1940. 

6. Murray, H. A. Explorolions in personality. New York: Oxford Univ. 

Pre^, 1938. 

7. Rietz, H. L. Handbook of mathematical statistics. New York: Hough- 

ton Mifflin Co., 1924. 

8. Spkasman, C. The abilities of man. New York: Macmillan Co., 1927. 

9. Thorndike, E. L. Theory of rnentcd and social measurements. New 

York: Teachers College, Columbia Univ., 1916. 

10. Thorndike, E. L. The fundamentals of learning. New York: Teachers 

College, Columbia Univ., 1932. 

11. Weiss, P. Functional properties of isolated spinal cord grafts in larval 

amphibians. Proc. Soc, Exper. Biology and Medicine, 1940, 44, 350 ff. 



CHAPTER XVIII 


The Reaction Threshold and Response Evocation 

It will be recalled that more than once in the preceding pag^, 
when discussing symbolic constructs such as sHb, D, sEr, Ir, and 
we have emphasized the scientific .hazards involved in their 
use. In this connection it has always been made clear that these 
dangers can be obviated only by having the constructs securely 
anchored in two temporal directions: (1) in objectively observable 
and measurable antecedent conditions or events, and (2) in objec- 
tively observable and measurable consequent conditions or evente. 
Up to the present we have satisfied these requirements reasonably 
well in respect to the first or antecedent direction by laying down 
the conditions which culminate in effective reaction potential (A)- 
To this end there have been shown in succession (1) the conditions, 
:and (2) the relevant principles which generate habjt strength 
{bH-r) ; which determine generalized habit strength [sHr) ; which 
show how drive (D) is generated, how habit strength and drive 
combine to produce reaction potential [sEr) , how inhibitory poten- 
tials {1r and bIr) are generated, how these combine with reaction 
potential to produce effective reaction potential and how 

oscillation {rOr) combines with ^r to produce momentary effective 
reaction potential {^r) . 

With the critical construct ^r thus securely anchored on the 
antecedent side, we are at last free to consider the events and 
principles whereby it is anchored on the consequent side. In 
general the latter relationships are somewhat simpler than the 
former. Briefly stated, the consequent anchoring events are reac- 
(R) , i.e., the movements or other activities of the organism. 
For ihe mc^ part these reactions are susceptible of direct observa- 
^on and automatic objective recording. 

At pr^nt the clearest and most dependable single quantitative 
relationship subsisting between qEr and R seems to be that of the 
probability (p) of the occurrmice of the response following stimu- 
Supplanenting the probability-of-reaction-evocation rela- 
(p) are ttree additional functional relationships which 
©ouirilHite to the anchoring of the qEr construct on the 
^de. mte: the latency of the reaction, the resist- 

s' 
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ance of the reaction potential to experimental extinction, and the 
ami^tude of the reaction. In the delineation of the relationship 
of stB to R -we shall accordingly take up first that based on the 
probability of reaction evocation. As a preliminary to this, how- 
ever, it will be necessary to introduce the concept of the reaction 
threshold. 

THE CONCEPT OF THE EEACTION THEESHOU) (bLr) 

As used in neurophysiological, psychological, and behavior the- 
ory and empirical practice, the term threshold implies in general a 
quantum of resistance or inertia which must be overcome by an 
opposing force before the lattor can pass over into action. So 
defined, the threshold concept fits many natural situations to which 
it is not customarily applied. Thus in beginning to drag a heavy 
object over a surface, one must often apply many pounds of trac- 
tion before the weight begins to move perceptibly; if traction is 
gradually increased, there comes a point at which the addition of 
one more ounce to the pull starts the weight moving. The traction 
at this point would be the approximate threshold. 

In an analogous manner, an appreciable weight must be placed 
on the skin before the subject can report its presence with a given 
degree of consistency; a certain amplitude of air vibration must 
reach the ear before the subject can consistently report a sound; a 
certain intensity of light must enter the eye before the subject 
can consistently report a color; the forearm must be moved a cer- 
tain minimal number of units of arc at its joint before the subject 
can consistently report that passive movement has occurred. All 
of these are stimulus thresholds, traditionally supposed to be based 
primarily on the resistance or inertia of the receptor mechanisms. 

In the early days of experimental psychology, as a result of 
the activities of Weber and Fechner (i), much time and energy 
were devoted to the determination of such thresholds. Because the 
early experimental jrsyehologists were chiefly German and because 
of the prevalence of certain philosophical beliefs in Germany at the 
time, particularly those associated vrith metaphysical idealism, 
these minimal reportable stimulations were thought to represent 
a ^antitative relationship between the physical and the psychic; 
this opposed transition of the physical into the psychic was thus 
conceiv^ as a process of the physical stimulus entering the door 
of consciousness, hence the use of the word threshold. Accordingly, 
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the Latin word limen (threshold) is still commonly used as a 
synonym for the threshold in psychophysics. For this reason the 
threshold in psychology is represented by the symbol L; this sym- 
bol, with the addition of a pair of qualifying subscripts, S and JJ, 
is employed in the present work to represent the reaction threshold, 
thus: sLjb. I 

Proceeding to the matter of the quantitative operational defini- 
tion of the threshold, it may be pointed out that in the chronaxie 
determinations of neurophysiology the threshold is defined as that 
minimal electrical current acting on any irritable tissue for an 
indefinite period which will evoke detectable activity, e.g., a dis- 
charge along a nen^e fiber or a movement in a bit of muscle {2, p. 
78 ff.) . In an analogous manner, the reaction threshold {bL^) is 
defined as the minimal effective reaction potential (sFb) which will 
evoke observable reaction; i.e., no reaction will occur unless 

is greater than zero* This difference we shall call the m'per- 
threshold elective reaction 'potentiaL 

imURlCT DEMONSTRATION'S OF THE EMPIRICAL REACTION 
TBOIESHOLD 

In ordinary behavior the reality of an empirical reaction thre^- 
old is demonstrated perhaps most clearly by the fact that in con- 
ditioning and other learning situations several reinforcements are 
fmjuently required before the stimulus will evoke the reaction. 
For example, in the conditioning of lid closure the conditioned reac- 
tion may be recorded as quite distinct from the blink which is 
«®ociat^ with the air puff or whatever the reinforcing stimulus 
happ^ to be. It thus comes about that the question of whether 
or not ttie conditioned stimulus is able to evoke the reaction to 
which it is being conditioned, is readily determined empirically at 
all staps of the learning process without the usual complicaticHi 
of extinction effects. Utilizing this circumstance, Hill (5) found 
in mm experiment that 15 per cent of his subjects showed their first 
OKMiticmai r^cticm at tiie second reinforcing stimulation, 16.7 
at tiiird, 11.7 per cent at the fourth, 8.3 "per cent at the 
p©r <^t at the sixth, smd 3.3 per cent at the seventh stim- 
lialiCiL CMk® tiUnip equal, ihe number of reinforcement® required 
iMmm first reacticm evocation is a quantitative indication of 
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the height of the reaction threshold. Thus those subjects giving 
their fii^t conditioned reaction at the sixth or seventh reinforce- 
ment presumably had higher empirical reaction thresholds than 
did those groups which gave their first conditioned reaction at the 
second or third reinforcement. 

A rather different quantitative illustration of the empirical 
reaction threshold is presented in Figure 5Q (p. 228). A careful 
examination of this figure shows that both fitted curves originate 
at the same point, namely, at a value of four extinction increments 
(aIb) below the point at which response would be evoked, i.e., 
below the empirical reaction threshold. This seemingly paradoxical 
result is distinctly revealing as to the nature of the reaction thresh- 
old; since, as we have seen, reactions will not occur if the excita- 
tory potential is below the reaction threshold, this extinction meas- 
ure of effective excitatory potential cannot function when sEr is 
less than This means that the zero value of any response 

scale falls exactly at the reaction threshold. 

Nevertheless we naturally wish to know how far the reaction 
threshold is above the absolute zero of effective reaction potential, 
or just no sEr at all. Even though the extinction-reaction tech- 
nique cannot directly enter this subthreshold region, it is possible 
to determine the shape of the learning function at numerous other 
points which are well above the reaction threshold, as shown for 
example by the circles in Figure 50. The curve or law of the 
function so determined, when extrapolated backward from the 
points empirically established to where iV" = 0 (and so, presumably, 
yields an approximation to a value impossible of direct 
measurement. Thus the mean empirical reaction thr^hold of rats 
umier the circumstances of Perm’s experiment {8) purports to be 
an mccitatory potential which would be approximately neutralized 
by the first four extinction reactions of the test series. 

Before leaving this subject it must be pointed out that while 
fhe empirical reaction threshold includes the true or inertial thresh- 
old (bLr), the two are not identical. There is good reason to 
I^lieve (see sixth terminal note) that not only the two types of 
empirical reaction thresholds just described, but all mirdmal stirn^ 
idm thresholds in psychophysics are the sum of the true inertial 
threshold plus an artifact of undetermined magnitude which arises 
from the action of the oscillation function {sOr) . As yet no attempt 
has been made to determine the relative magnitude of the two 
factors entering into either the empirical reaction threshold or the 
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^imulus threshold of psychophysics; such a determination, while 
complicated and necessarily indirect, should be possible; when made 
it would not be surprising if the oscillation component is found to 
exceed in magnitude the true or inertial reaction threshold (gL^). 
For the purposes of the present preliminary analysis, the quantita- 
tive separation of the two presumptive components is not neces- 
sary.^ 

THE FUNCTIOlSrAL RELATIONSBCIP OF REACTION-EVOCATION 
PROBABILITY (p) TO THE EFTECTIVE 
REACnON POTENTIAL (sMb) 

It will be recalled that as the result of certain considerations 
put forward in Chapter VIII (p. 102 ff.) we concluded that habit 
strength is a simple positive growth fimction of the number of 
reinforcements. Since reaction potential is a joint multiplicative 
function of habit strength and drive (p. 242), it follows that so 
long as drive remains constant, reaction potential will also closely 
approximate a simple growth function of the number of reinforce- 
ments. In the case of some types of response, such as salivary 
secretion and the galvanic skin reaction, the mean amplitude of 
response in simple learning situations has been found to be a posi- 
tive growth function of the number of reinforcements (p. 103 ff.) ; 
this suggests a very simple (linear) relationship between the am- 
plitude of such learned responses and effective reaction potential 
Cs^je)- 

Many reactions, such as the bar-pressing movements of Perm’s 
rats, approach the all-or-none type, which differs appreciably from 
the galvanic skin reaction, salivary secretion, etc. The all-or-none 
type of reaction introduces into the situation (1) the reaction 
thr^old and (2) the oscillation of habit strength (p. 304 ff.). 
Oiring to the simplicity of the threshold concept and to the fact 
toe (filiation function has already been fairly well established 
by indep^dent investigations, the natural and strategic way to 
^nceive toe progr^ of learning from the response side is found 
in the probability (p) that the impact of the conditioned stimulus 
will evoke toe r^ponse. Accordingly our examination of the quan- 
titotive relationship of effective reaction potential to the four 
wm^m of ite ob^rvable manif station in action will begin with toe 

<£ tfadb pointy see sixth, terminal note. 
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probability of the reaction evocation of the all-or-none type of 
response. 

Let it be assumed that the habit strength of an all-or-none 
type of reaction is reinforced 36 times, with uniform time inter- 
vals between reinforcements great enough to prevent the accumu- 
lation of appreciable amounts of reactive inhibition (/j?), Por 
the sake of simplicity in theoretical calculation, let it further be 
assumed that the drive is constant, that this and the conditions 
of reinforcement are such that the asymptote of effective reaction 



Peg. 71 . Diagram showing the gradual movement of the aone of reaction- 
potential ceciiiation (up-ended bell-shaped areas) across the reaction thr^iold 
(jl®). The upper or growth curve represents as a function of N and is 
plotted from columns 1 and 2 of Table 10. For further explanation me text. 

potential {sEr) with unlimited practice will be 80 wats (p. 134 ff.), 
and that the nature of the reinforcing agent and related conditions 
surrounding the reinforcement proce^ is such that the increment 
of effective reaction potential at each reinforcement will 

be approximately one-twentieth of the difference between tiie effec- 
tive reaction potential just preceding tiiat reinforcement and the 
^wat ^nnptote. The reaction potentials just preceding each 
stimulation {and reinforcement) have been calculated on the abwe 
principle and are prorated in numerical detail in the ^ond column 
of Table 10. These values are representei graphically by the 
upward-arching growth curve shown in Figure 71. 
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Now, since reaction can be evoked only when the effective reac- 
tion potential exceeds the reaction threshold, i.e., when 

sEr > sEr 

and since by hypothesis in the present instance, 

sLr = 10 wats, 

it follows from the values in column 2 of Table 10 that the con- 
ditioned stimulations associated with the first three reinforcemeniB 
cannot evoke the reaction being conditioned. Generalizing, we 
arrive at our first corollary: 

I. In the original learning of reactions of the alUor-none type, 
at least one and often, a number of reinforcements are required be- 
fore the reaction can be evoked by the conditioned stimulus alone, 
the number of reinforcements before reaction evocation being a 
decreasing function of the steepness of slope of the learning curve. 

It does not follow, however, that when 

S^B ^ sIfR 

the impact of the conditioned stimulus will necessarily evoke the 
reaction being conditioned. This uncertainty comes from a num- 
ber of independent considerations. The one which especially con- 
cerns us here is the oscillation principle, discussed at some length 
in an earlier chapter (p. 304 ff.). In order to illustrate the opera- 
tion of the oscillation principle in the present situation, let it be 
ai^umed that the factors determining the oscillation of reaction 
potential are such that when operating at their maximum they are 
sufficient to neutralize any superthreshold effective reaction potei- 
tial up to 50 wats which may be present, but when operating at a 
minimum, their neutralizing effect would be zero. Moreover, in 
ac<x>rdance with considerations put forward in an earlier chapte 
Cp. tile magnitude of these depressing effects presumably 

vari^ from moment to moment in a symmetrical manner about a 
miteal tendency according to the Gaussian ^law” of probability. 
Wm convenience it will be ^sumed that the magnitude of oscillation 
varies over a range of 5 o (standard deviations) ; thus the standard 
(^viaticm of 4^iilati0n (oo) in the supposed situation would have 
a value of SB wats divide by 5, which yields a quotient of 10 waiB. 

WWh valii^ available and with the aid of a suitable table 
im ttae of ssx infinite sample) we may determine 

of reaction evocation after each r^forcement. Hic 
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^TABLE 10 

A Table Showing the Sevekal Steps op the Derivation op the Prob- 
ability OF Reaction Evocation as a Joint Function of the S-eaction 
Threshold, the Strength op Effective Reaction Potential CsEs), and 
the Magnitude of the Oscillation op the Reaction Potential. Strictly 
Speaking, These Values, Particularly Those of the Probability Function 
( p), Presuppose an Unlimited Sample of Homogeneous Behavior. 


Huml^r of 
Preceding 
Reinforce- 
ments 
(V) 

I 

Effective 

Reaction 

Potential 

(sEb) 

ir 

Suj^rthreshold 
Effective 
Reaction 
Potential 
(sEb sLb) 

m 

(sEr — sLr) 
<ro. 

Where 

<ro = 10 

IV 

Probability 
of Reaction 
Evocation (p) 
Derived from 
Table 9 and 
Column IV 

V 

0 

0.00 

.0 

.0 

.0 

1 

4.00 

.0 

.0 

.0 

2 

7.80 

.0 

.0 

.0 

3 

11.41 

1.41 

.141 

.2 

4 

14.84 

4.84 

.484 

1.66 

5 

18.10 

8.10 

.810 

3.84 

6 

21.19 

11.19 

1.119 

7.45 

7 

2413 

14.13 

1.413 

12.94 

8 

26.93 

16.93 

1.693 

20.56 

9 

29.58 

19.58 

1.958 

30.23 

10 

32.10 

22.10 

2.210 

37.45 

11 

34.50 

24.50 

2.450 

49.37 

12 

36.77 

26.77 

2.677 

57.29 

13 


28.93 

2.893 

64.91 

14 

40.99 

30.99 i 

3.099 

71,94 

15 

42.94 

32.94 

3.294 

78.18 

16 

44.79 

34.79 

3.479 

83.50 

17 

46.55 

36.55 

3.655 

87,86 

18 

48,22 

38.22 

3.822 

89.69 

19 

49.81 

39.81 

3.981 

^.68 

20 

51.32 

41.32 

1 4.132 

^.88 

21 

52.76 

42.76 

4.276 

^.76 

22 

54.12 

44.12 

4.412 

96.48 


55.41 

45.41 

4.541 

97.08 

24 

56.64 

46.64 

4.664 

97.97 

25 

57.81 

47.81 

4.781 

^.29 

m 

58.92 

48.92 

4.892 

^.54 

27 

59.97 

49.97 

4.997 

99.00 


60.97 

50.97 

5.097 

100.00 


61.93 

51.93 

5.193 

100.00 

30 

62.83 

52.83 

5.283 

100.00 

31 

63.69 

5a69 

5.^9 

100.00 

n 

64.50 

54.50 

5.450 

100.00 

33 

65.^ 

55.28 

S.52S 

100.00 

34 

66.01 

56.01 

5.601 

100.00 

35 

66.71 

56.71 

5.671 

100.00 

36 

67.38 

57.38 

5.7:^ 

100.00 
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extent to which the upper limit of behavior oscillation exceeds the 
reaction threshold, i.e., the value of the superthreshold effective 
reaction potential (s^r — sLb), is. obtained by merely subtracting 
10 from each of the sEn values; these values are shown in column 
III of Table 10; they are indicated graphically in Figure 71 by 
the extent to which the up-ended, bell-shaped oscillation distribu- 
tions project above the reaction threshold. This portion of the 
several distributions has been shaded to facilitate identification. 
The value of — sLb is next divided by the value of the stand- 
ard deviation of the oscillation (oo), i-©-? by 10; the resulting ratic^ 
are given in column IV of Table 10. 

Finally, the probability of reaction evocation at each reinforce- 
ment may be found by means of columns c and d of Table 9 (p. 
311). For example, the c-value in Table 9 nearest to .141 is .1, 
for which the d, or probability, value is .2 per cent. Similarly, 
the c-value nearest to 1.119 is 1.1, which has a probability or 
d- value of 7.45 per cent, and so on. These probability values are 
given in column V of Table 10. 

The moral of the rather complicated method of calculating p 
just described is that: probability of reaction evocation is a normal 
probability {ogival) function of the superthreshold magnitude of 
effective reaction potential. If this hypothesis, coupled with the 
growth hypothesis of the relation of the number of reinforcements 
to sHb (and so to yields learning curves conforming sub- 

stantially to those observed under corresponding empirical con- 
ditions, all hjrpothes^ involved in the derivation will tend in so far 
to be substantiated. 


THM KlLiCrrOISr-EVOCATION LEARiaNG CURVES IMPLIED BY THl 
THRESHOLD-OSCILLATION HYPOTHESIS 

A graphic repr^entation of the progressive movement across the 
reaction thr^hold of the zone of reaction-potential oscillations as 
a wh<rie is shown in Figure 71. In each normal distribution the 
I^r mit that the shaded area stands to the entire area imder ihe 
bell-diaped curve (column V of Table 10) is the probability (p) 
that an adequate stimulation will evoke the conditioned response. 
Becau^ of ilieir special significance these latter values are repre- 
by a ^p^ate g?^ph, which is shown in Figure 72. Actually 
CTirve cicely parallels an extensive group of empirical learn- 
ing earrm, a which stron^y tends to substantiate the thr^i- 
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old-oscillation hypothesis. A good example of this type of empirical 
learning curve from a simple conditioning situation is that of 
Hilgard and Marquis, shown in Figure 73. An example from a 
much more complicated experimental situation is given above, in 
Figure 68 (p. 307). 

It is highly probable, however, that many learning curves of 
this general character reported in the literature have their shape 
determined in part at least by other factors. Presumptive exam- 
ples of sigmoid learning curves so complicated are seen in Figure 24 



Fig. 72. Graph showing a theoretical probability-of-reaction-evocation 
CTjTve of learning plotted as a function of the number of reinforc«nents. Note 
the i^mewhat distorted sigmoid shape of the learning curve as contrasted 
with the ample growth function shown in Figure 71. The extent of the 
distortion is indicated by the unequal separation of the three vertical lines 
drawn through the curve. 


(p. 108). One of these complicating factors is the quasi-normal 
distribution in the learning difficulty of the various elements which 
are learned, e.g., the several syllables of rote series, the learning 
scores of which are pooled in the plotting of learning curves. An- 
oth^ source of the initial pha^ of positive acceleration someiimas 
found in learning curv^ is the interaction of the (^illation of two 
competing reaction potentials ot^erved imd^ captain conditions of 
i^ple trial-and-error learning (Figure 24 and p. 107 ff.). 

It is quite evident that the characteristics of the curves of 
both Figure 72 and Figure 73 are radically different from th<^ 
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of the simple positive growth fimction to which certain empirical 
learning curves approximately conform (see Figures 21 and 23 
p. 103 ff.) and which we have found reason to believe also paralldfe 
the functional relationship of habit strength to the number of rein- 
forcements. The relationships of the two contrasting types of 
learning curve may perhaps best be understood by comparing Fig- 
ure 72 with the simple positive growth function from which it was 
derived; this is shown as the upper bounding curve in Figure 71 
At a glance the probability-of-reaction-evocation curve is seen to 
be approximately sigmoid or ogival in form. A closer examination 



Fm. 73. An empirical probability-of-evocation type of learning cmve 
^owmg a roughly sigmoid shape. This graph represents the frequency oi 
conditioned lid reactions in dogs on each successive day of conditioning. 
(Adapted frcun Hilgard and Marquis, 4, p. 112.) 

shows, however, that this learning curve differs from the true ogive 
(Figure 70, p. B13} in a number of important respects. In the 
pia^, the probability-of-reaction-evocation function (p) has an 
inilial which is horizontal, standing at zero probability for 

three ^cc^ive stimulations. ITiis phenomenon has already been 
formulated as Corollary L An empirical parallel to it may be 
ol^rved in the initial portions of the curve shown in Figure 73. 

A ^ond and analogous difference lies in the fact that the 
poh^iMty-of.reacrion-evcxiation curve has also a final phase of 
exteifc in winch it is quite horizontal at 100 per cent, even 
^rniiig in the smse of increase in habit strength and 
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reaction potential progresses steadily (Table 10 and Figure 71). 
These considerations lead to the formulation of a second corollary: 

IL In the original simple conditioning or learning of an all-or-^ 
none type of reaction j the maximal level of 100 per cent of reaction 
evocation may occur in the later stages of reinforcement even 
though the reaction potential may steadily increase through con- 
tinned reinforcemenL 

It may be noted in coimection with Corollaries I and II that 
the probability of reaction evocation {p) ceases to be an indicator 
of reaction potentiality both when the zone of oscillation is 
wholly below the reaction threshold and when it is wholly above 
it (Figure 71). For this reason, probability of reaction evocation 
in these extreme ranges is an entirely inadequate indicator either 
of habit strength or of effective reaction potential. Where reactions 
of the all-or-none variety are being learned, there are available, 
however, two supplementary measures throughout the considerable 
upper range of reaction potential which yields a uniform IQO per 
cent of reaction evocations. These are: (1) reaction latency (P), 
and (2) resistance to experimental extinction (11). 

A further examination of Table 10 and Figure 72 shows that 
the probability-of-reaction-evocation learning curve differs from 
a true ogive (Figure 70) in that the steepness of rise of the first- 
half is relatively greater than that of the second half. For example, 
the cur\"e of Figure 72 rises from the last zero probability (after 
two reinforcements) to 50 per cent probability (after about 11 rein- 
forcements) as the result of 11 — 2 or 9 reinforcements. On the 
other hand, it requires about 17 reinforcements (28— 11) to pass 
from the 50 per cent level to the KX) per cent level. The true ogive 
of the normal probability function is, of course, quite symmetrical 
(see Figures 70 and 75) . The as 3 rmmetry of the near-ogival prob- 
ability-of-reaction-evocation learning curve is due to the influence 
of the progressively slower rise of the learning growth function 
(b^b) ns it approaches its asymptote. Thus we arrive at our tiiird 
corollary: 

III. The prohahility-of-reaction-evocation type of leearmmg 
amjCy while roughly resembling the normal ogive ftmction, dearly 
detmtes from it in that as the prdbab^ity of reacikm evocation 
imreases from zero to 100 per cent, the rate of rise of the prob- 
ability of reaction evocation is progressively slower as com^mred 
with the corresponding portion of the normal ogive. 

It is important to note in this coimection that the initial period 
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of positive acceleration of the learning curve is to be expected m 
the present set of hypotheses only when the learning starts from 
absolute zero, or when the rate of learning is relatively slow, or 
when both these conditions obtain. Calculations have been made 
of the values of p. when at the outset (possibly through gen- 
eralization of excitation) is 
supposed to stand at 7 wats 
rather than at zero and when 
the fractional learning incre- 
ment is taken at the relatively 
large value of 1/lQ rather 
than at 1/20 ; these p valu^ 
are represented graphically in 
Figure 74. There it may be 
seen that under the assumd 
conditions a fairly conven- 
tional growth-type curve of 
learning is to be expected, 
with scarcely any suggestion 
of the initial period of positive 
acceleration shown in Figure 
72. It is believed that this is 
the explanation of the failure 
of many reactions of the all- 
or-none type to show the gen- 
erally ogival form of learning 
curve. In this connection it 
may be recalled that Thur- 
stone foimd the sigmoid form of learning curve only when the 
material to be learned was difficult {10), 

The preceding bit of analysis accordingly brings us to our 
fourth <x>rollary: 

TV. If the Uaming of an all-or-none type of reaction sets out 
with an sEb vcdue appreciably above zero, and if the rate of learn- 
wig is relatively rapid, the initial period of positive acceleration 
clmracteristic of this type of learning will not appear in the prob- 
od^Mty-of-reactionr-evocation cwrve of learning. 

As a brief summary of tiie foregoing examination of the rela- 
of tike theoretical construct, effective reaction potential 
to reactkm evocation as based on the threshold-oscillation 
we pr^ent in Figure 75, in comparable graphic form, 



NUMBER OF REINFORCEMENTS (N) 


Fig. 74. A theoretical probability-of- 
reaction type of learning curve resulting 
from the a^wmption of a rapid rate of 
habit acquirition together with a sub- 
i^antial reaction potential from general- 
ization at the outset of the reinforce- 
ments. See text for details. 
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the two functions which combine to produce the sigmoid theoretical 
cun-e of learning shown in Figure 72. The upper portion of Fig- 
^ 75 represents s^s as the familiar simple positive growth fune- 



SVPERTHRESHOLO reaction potential 

Fig. 75. Graph showing the analysis of the theoretical learning ciirve_of 
Kgure 72 into two components connected by the mediatii^ construct sEs- 
The upper curve diows sEs as a growth function of N, and the lower curve 
shows p as an ogival function of ^s-sLs. 

tion of the number of reinforcements (iV). The lower portion of 
the figure shows the critical second component, the probability of 
reaction evocation, as an ogival function of i.e., of the effective 
reaction potential less the reaction threshold. This latter type of 
function is the special concern of the pr^ent chapter, since it 
serves to anchor the construct gEs to an oteervable consequent 
event. 
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REAOnON lATENCr (atn) AS A FTTNOTtOIT OF EraECTrvE 
EEACTION POTENTIAL (s^b) 

^^ojeeding with our systematic examination of the relationshin 
f effective reaction potential to quantitatively observable respond 



^ 10 15 20 

NUMBER OF REINFORCEMENTS (N) 


R£ACT?Sn POTE^fAL IN WATS 

Sisiley's theoretical components of one d 

ruction latency (siM). The (^^igure 22) which is plotted in terms of 

^rength (and ^ of nlnrt^ component is the familiar curve of hdt^ 
of reinfort^ments (M) Tli i ^ ^ simple growth function of the numb^ 
The broken line' «»n'P®ent represents sts as a function of 

Oian about 24 wats retire.--- tow^ infinity as sEs values grow 

thre^ whieh^^ 

relationship of sfg 
CFr Btm, the time mterveniug between the begin- 
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Ding of the stimulus and the beginning of the response. Unfortu- 
nately, the quantitative aspects of this function are complicated by 
the conditions of reinforcement in a manner not yet fully deter- 
Diined, It is well known, for example, that given suitable condi- 
tions of learning, organisms can be trained to react after a consid- 
erable range of predetermined delays. However, if the promptness 
of the reinforcement is dependent upon the promptness of the reac- 
tion, there is in general an inverse functional relationship of the 
number of reinforcements to the reaction latency. The following 
analysis assumes these latter learning conditions. 

This relationship is perhaps best represented by means of a 
graphic analysis of the Smiley (9) reaction-latency learning curve 
shown in Figure 22. Such an analysis is presented in Figure 76 
in a manner parallel to that of the purely theoretical probability- 
of-evocation fimction shown in Figure 75. Thus in the upper por- 
tion of Figure 76 there appears the familiar positive growth function 
representing the relationship of the effective reaction potential to 
the number of reinforcements. In the lower portion of the figure 
we note the functional relationship which is of present interest, 
that of sts to jsSb. a glance at this portion of the figure shows 
that sts is a negatively accelerated decreasing junction of 
where is greater than about ^4 watSj the empirical reaction 
threshold, 

RESISTANCE TO EXPEEIIMENTAL EXTINCTION (n) AS A FUNCTION 
OF EFFECTIVE REACTION POTBNTIAIi 

Continuing our systematic exammation of the relationship of 
effective reaction potential to quantitatively observable response 
phenomena, we attempt as our third task the determination of the 
functional relationship of s'Eb to the number of unreinforced reac- 
tion evocations (n) required to extinguish a reaction potential to 
a ^ven degree of impotence, say to three successive stimulations 
which fail to evoke observable reaction. Here, much as in the case 
of reaction latency, the quantitative aspects of tiie probl^ are 
complicated by the conditions of reinforcement. It has hem shown 
by Humphreys (7), for example, that the cour^ of extinction is 
rather different when the reinforcements of the original learning 
have beai accompanied by a considerable number of non-reinforc^ 
stimulations. The pi^ent analysis is accordingly ^mewhat ten- 
tative, as was that concerned with It proceeds <m tibe a^ump- 
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tion that the conditions of reinforcement are constant throughout 
and that they are uncomplicated by non-reinforcement or other 
disturbing factors. 

This relationship may perhaps best be illustrated by means of 




Fig. 77. Graphic representation of the theoretical components of 
empiricai learning curve (Figure 23) which is plotted in terms of the numb^ 
erf iinreinforced reaction evocations (n) required to produce experimental 
extiac^on. The upper component is the usual curve of habit strength (and 
m of plotted as a simple growth function of the number of reinforce 
meiste iN), The lower <x)mponent represents as a simple Hnear funetkm 
€>1 JfjB, 

a graphic analyms of Williams^ learning curve shown in Figure 23. 
Tlife eurve, it wiU be recalled, represents the number of unrein- 
(n) require to produce a given degree of extinc- 
ttoB m a fimctaon of the number of reinforcements, drive remainh^ 
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constant. The theoretical analysis of this empirical fxmction into 
two components in parallel with the analysis presented in Figures 
75 and 76, is shown in Figure 77. As in Figures 75 and 76, the 
upper portion of this figure represents reaction potential as the 
familiar positive growth function of the number of reinforcements. 
The lower curve shows as the second component that the number 
oj extinction reactions (n) is a simple increasing linear function 
of the reaction potential (s^r) . 

It may be noted incidentally that, according to this function, n 
becomes negative when sEr has values less than about 8 wats. This, 
of course, is an expression of the reaction threshold emphasized in 
connection with the interpretation of Figure 50. 

INTENSITY OR AMPLITUDE OF EEACJTrON (a) AS A FUNCTION OP 
EFFECTIVE REACTION POTENTIAL { s ^ r ) 

As our fourth and concluding analysis of the relationship of 
effective reaction potential to quantitatively observable response 
phenomena, we shall consider that of sEr to a variety of reaction 
which is not of the all-or-none type. We have chosen for this pur- 
pose Hovland’s empirical curve of the acquisition of a conditioned 
galvanic skin reaction, shown as Figure 21 (p. 103). In connection 
with this analysis it must be pointed out, much as in the cases of 
reaction latency and of extinction, that the function may possibly 
be complicated by variations in the conditions of the original 
acquisition of the reaction potential. It is known (p. 305 ff.) that 
striated-muscle reaction may easily be trained to a particular am- 
plitude by special conditions of reinforcement. However, since 
no evidence, experimental or observational, has been found of such 
a tendency in the case of either the galvanic skin reaction or 
salivary secretion, distorting complications of the functional rela- 
tionship of A to sEr, where the autonomic nervous system is pri- 
marily involved, seem rather imlikely. At all events, it is assumed 
in the following analysis that no such complications are involved. 

The components into which this learning curve breaks up are 
presented in Figure 78 in a manner exactly parallel to Figures 75, 
76, and 77. The upper portion of Figure 78 shows, as usual, the 
effective reaction potential {bEr) as a simple positive growtii func- 
tion of the number of reinforcements (W). The lower porticm of 
the figure, our chief concern here, reveals that the amplitude of the 
conditioned galvanic skin reaction is a mmple linear increasing 



340 PRINCIPLES OF BEHAVIOR 

function of the elective reaction potential. It thus resembles very 
closely the relationship of aEs to the number of unreinforced reac- 
tion evocations (n) required to produce extinction. However, a 
striking difference is also to be noted: whereas in Figure 77 the 
straight line originates below the reaction threshold, in Figure 78 
it originates an appreciable distance above it; this reflects tiie 




Fh 3. 78. Hie analyiM of a. curve of the conditioning of the galvanic 
i^in reacti<Hi {Figure 21) into two components connected by the theoretical 
(xnistroet sEm. 

well-known fact that previous to specific conditioning, almost any 
stimulus win evoke the galvanic skin reaction. Stated in another 
way, this means that at the outset of the learning process here 
under cormideration (Figure 21), tiie reaction tendency is wdl 
above the reaction thr^old, just as that shown in Figure 50 is 
^jpreciably bdow it. 
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the competition of simultaneous incompatible reaction 

POTENTIALS 

There remains to be considered one more important matter 
before the relationship between and reaction is formally com- 
plete. It will be recalled that when discussing the reaction thresh- 
old we pointed out above that no reaction will be evoked unless 
sEb is greater than sLb- It must now be noted that reaction will 
not inevitably occur when sEb exceeds sLb, or even when the mo- 
mentary effective reaction potential (sEb) is greater than sLb- 
This is because there are frequently encountered situations in which 
the stimulus complex impinging on the organism may simultane- 
ously give rise to two or more incompatible reaction potentials. 
Examples of such reaction tendencies would be opening and clos- 
ing the eyelids, extending and flexing the arm or leg, or speaking 
almost any two words of a language. It is obvious that in such 
a situation the momentarily weaker of two competing reaction 
potentials cannot possibly mediate its reaction. 

Whether in such cases the reaction potential of the dominant 
tendency is completely brought to bear in the evocation of the 
reaction, or whether it suffers some diminution resembling that 
long known as associative inhibition, is not known. Certain ob- 
servations suggest that the latter supposition is the true one. At 
all events, the experimental evidence has led some investigators 
{B, p. 206 ) to the view that the interference of incompatible reac- 
tion potentials may be mutual and that this generates an inhibitory 
potential which behaves in many, if not all, respects as does that 
generated by experimental extinction. Unfortunately the dynamics 
of this very common situation have not been sufficiently invi^i- 
gated to warrant an attempt at a detailed quantitative statement, 
particularly as to possible indirect inhibitory effects. 

Ignoring for the present, then, possible generalized inhibitory 
tendencies which may result from tiie competition of incompatible 
reaction potentials, we state as a first approximation that if two or 
more swperthreshold reaction potentials exist in an orgcmism at the 
same instant, only the reaction of that one whose (^dUation imlue 
at the moment is greatest will he evoked. Thus is formally con- 
cluded the tads of anchoring the construct to objectively ob- 
^rvable behavior on the consequent side. 
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SUMMARY 

The pivotal theoretical construct of the present system is that 
of the effective reaction potential (sEr). An attempt has been 
made in the preceding chapters to anchor this in a secure and quan- 
titative manner to antecedent observable conditions of habit for- 
mation, of motivation, and of stimulation immediately preceding 
reaction evocation. The task of the present chapter has been to 
anchor it in a parallel manner on the posterior or consequent side. 
The observable phenomena available for this purpose are found in 
four aspects of reaction evocation: (!) the probability (p) of reac- 
tion evocation; (2) reaction latency (stR) ; (3) resistance to experi- 
mental extinction (n) ; and (4) in the case of autonomically medi- 
ated reactions, reaction amplitude (A) . This means that a success- 
ful performance of our task involves the quantitative determination 
of the functional relationship of sEr to p, stit, n, and A, respec- 
tively. 

Of the four relationships, the one at present offering the best 
prosi>ects of a successful conclusion is that involving the probability 
of reaction evocation, p. This is in part because the various addi- 
tional consixucts necessarily involved are revealed in a fairly obvi- 
ous manner by relatively independent considerations. The con- 
structs in question are (1) the reaction threshold (sLr) and (2) the 
downward oscillation (aO^) to which effective reaction potential 
(sEr) is believed to be subject. 

So long as the maximum reaction potential lies below the reac- 
tion threshold, no activity can be evoked. However, as this maxi- 
mum the reaction threshold in the more simple learning 

situations, the probability of reaction evocation increases pro- 
gr^ively until the zone of oscillation is wholly above the reaction 
threshold, when the probability of reaction will be IQQ per c^t, 
or Since it may require several reinforcements to raise 

to a value exceeding that of rEr, it often happens that there is 
an initial region of uniformly zero reaction probability in learning 
curv^. Similarly, the zone of maximum oscillatory interference 
with rEr (minimum value of rEr} may have passed above the 
re^tion thr^hold before the limit of habit acquisition afforded by 
the ^rndhaons of reinforcement is reached; this may bring about 
a more protracted terminal period of uniformly j^fect 

probability (100 per cent) in this tjqie of learning curve. 
Tinally, owing to the pr^umably normal distribution of the oscil- 
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lation function, its passage across the reaction threshold yields 
during the learning process a probability-of~reaction- evocation 
learning curve which possesses when complete a distinctly skewed 
ogival form. The general agreement of learning curves secured 
under corresponding empirical conditions, with the three theoreti- 
cal implications just listed, gives considerable additional substan- 
tiation to the belief in the general soundness of the various hy- 
potheses involved. 

One of the results emerging from the above analysis is a further 
confirmation of the critical hypothesis that (and so gWjt, in 
case D is constant) is a simple positive growth function of the 
number of reinforcements (N). This last consideration is of stra- 
tegic importance because it enables us to determine in an indirect 
manner the presumptive functional relationship of each of the three 
remaining response phenomena to the standard or maximal value 
of effective reaction potential It happens that the sample 

empirical learning curve based on each of the three response aspects 
is expressible to a reasonably close approximation by equations, 
all of which contain an explicit positive growth fimction of N, The 
replacement of this expression by its equivalent, sEs, leav^ the 
particular response phenomenon stated in terms of sEs- This type 
of analysis when applied to the learning curves based on pr^m- 
ably representative sets of empirical data indicates as a first ap- 
proximation that: reaction latency (stj?) is a negatively accelerated 
decreasing fimction of effective reaction potential (sEr) ; that 
r^istance to experimental extinction (n) is an increasing linear 
function of effective reaction potential {^r) ; and lhat amplitude 
of reaction (mediated by the autonomic nervous system) is an in- 
creasing linear function of effective reaction potential (s^r). 

However, situations frequently occur where the stimulus com- 
plex impinging on the oiganism at a given moment is such as to 
give rise to two or more incompatible reaction potentials. Ignoring 
for the present the possibility of indirect inhibitory effects^ we 
may say as a first approximation that in such situations the reac- 
tion potential which is strong^- will dominate the others by evok- 
ing its reaction. Thus is formally completed the anchoring of the 
constructs ^r and to objectively ote^able ph^cnnena on 
the cons^uent side. 

Gmeralizing on the considerations elalorated above, we now 
formulate six additional primary principles: 
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POSTULATE 11 

Tlie momentary effective reaction potential (s^r) must exceed the 
reaction threshold (sLr) before a stimulus (S) will evoke a ^ven re- 
action (R). 

POSTULATE 12 

Other things equal, the probability (p) of striated-muscle reaction evo- 
cation is a normal probability (ogiv^) function of the extent to which 
the effective reaction potential (sEr) exceeds the reaction threshold 
(sLje). 

POSTULATE 13 

Other things equal, the latency (sts) of a stimulus evoking a striated- 
musde reaction is a negatively accelerated decreasing monotonic function 
of the momentary effective reaction potential provided the latter 

exceeds the reaction threshold (sLr). 


POSTULATE 14 

Other things equal, the mean number of unreinforced striated-muscle 
reaction evocations (n) required to produce experimental extinction is a 
simple linear increasing fimction of the effective reaction potential {sEr) 
provided the latter at the outset exceeds the reaction threshold (sLr). 

POSTULATE 15 

X>ther things equal, the amplitude (A) of responses mediated by the 
autonomic nervous system, is a simple linear increasing function of die 

mmnentary effective reaction potential (sEr). 

POSTULATE 16 

When the reac^mn potentials (sEr) to two or more incompatible re- 
actkms (R) occur in an organism at the same time, only the reaction 
whose mommtary ^ective reaction potential (sEr) is greatest will be 
evoked. 

PcMtulate 16 <x)mpletes our statement of primary principles. 


NOTE^ 

Mattenatical Sfettement of Postulate 11 


( 46 ) 
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Mathematical Statement of Postulate 12 


= 0 when &Ee sLr 

s^R + 2.5 <r 


■|[W2.5)-f(< 


)] 


1 when sEr ^ sLr + 5 <r, 
where <r is the standard deviation of sOr, 
and 

^(<r) = ^(0 dt, 

%J — oo 

whffe 4>(t) is the standard probability function. 


Mathematical Statement of Postulate 13 

a' 

“ fits'’ 

where a' and 5' are positive empirical constants. 


m 


(48) 


Mathematical Statement of Postulate 14 

n^c'aER--f\ (49) 

where d and / are empirical constants. 

Mathematical Statement of Postulate 15 

A = h'^R - 1', (50) 

h' and i' are empirical constants. 

Mathematical Statement of Postulate 16 
(Mw)-v>€8Bb^: (Sx) (3y)’y€S: > sts: 3 = (51) 

The Oscillation Component of the Empirical Reaction Threshold 

This component would arise in a ample rdnforcement determination of the 
reaction threshold (as in the experiment cited from Hill, 5) in the following 
m anne r: As the (maximum) value of sEr is passing the ruction threshold, it k 
in the high^t d^ree improbable that s^r will be^^res^ to the minimal (searo 
CHT near-zero) degree of oscillation the first time sEr is tested for reaction f^ure^ 
For sample, on the assumption that tiie maximum range of osdOiliitimi Is 5 
tte chances are ^ to 5 against sEr exceeding the reaction threshoM evm wh^ 
standard or value of rEr has risen above ihd reacticm thre^mM by 

as much as one-fifth its range of osdllaticm, Le., nearly as mudx as tiiat tiiown 
at = 6 in Kgure 71. 'Ihis means, of course tiiat with a limited nunJb^ 
triaJ^at thk level of leMiung tii^re wiH inevitdWy be an appreci^ie valim 
of sEr previ<ms to tl^ first reactimi evocation. It m evideit that the extent to 
which tins mean value of sSr at the fin^ reaction evocatiim exceecfe zero mu^ be 
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included in the magnitude of the empirical reaction threshold as indicated dxyve 
from conditioning data. Indeed, it would produce a considerable superfidal 
indication of an empirical reaction threshold even though no inertial reaction 
threshold (sLr) were present. 

By exactly analogous reasoning it is easy to show that a parallel effect wouM 
result from the determination of the empirical reaction threshold by the poding 
of results secured in the ordinary extinction of all-or-none reactions. 


The Equations From Which the Curves in Figure 75 Were Plotted 
The equation of the upper or growth fimction is, 

== 100 (1 - 10--02226iV^. 

The equation expressing the functional dependence of p upon is, 

r = 0 if rBr sLr 
^ = L012^[^(2.5) - ^(3.5 - 
1 if sEr ^ sL/r + 5 O’, 

where <r is the standard deviation of the oscillation function ( sOr ), rLr is taken 
as 10 wats, and 5 <r is ta^n as 50 wats. This is the first of the four critical eguaiions 
anchoring the construct sEr on the consequent side to ohjectivdy observable fihenomem, 
Ihe second of the three right-hand members of the above expression represents 
the normal <^ve function. 


The Equations From Which the Curves in Figure 76 Were Plotted 

The equation fitted to one of Simleyh composite empirical learning curves 
(Figure 22 plotted in terms of reaction latency) {sti^ is, 

■ [100 (1 - 10-'122iV)].4‘ 

In^)ection of the right-hand member of this equation reveals the expr^rimi 
(100(1 10 “*^ 22 Ar) is the familiar expression of ^r as a positive growtii 

function of This circumstance enables us directly to write the equation, 

sis = 100 (1 - 10- i22i^, 

from which was plotted the cmve in the upper portion of Figure 76. 

Bejdaeing 100 (1 — equation by rEr, we have as the 

emnpcmeit, 


ekt 


5.11 

sis"' 


im®a which was plotted the lowo* curve of Figure 76. This is the second of the 
fomr cri^cml ^otiom cmchoring the construct rEr on the consequent side to obfedu^ 
^^momen^ It ^Hmld be noted that, strictly speaking, thm equation 
does hoM for sSr values bdow about 24 wats, which in this case marks tiie 
rm^m ttaeshoidL 
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The Equations From Which the Curves of Figure 77 Were Plotted 


The equation fitted to Williams^ and Perm's empirical lea rning curve {Figure 

m is, 


n = .66 [100 (1 - - 4. 


By inspection, the bracket in the right-hand member of this equation is a positive 
growth function of N and presumably represents We accordingly may 

write the equation, 

sSb = 100 (1 - 10“--oi8^. 


It is from this equation that the upper curve of Figure 77 is plotted. 

Replacing the bracket of the original fitted equation by we may write 
from equation 43, 

fi — (E — W){8Er — bLr )^ 
c 

where B = 116.6, W = 42.5, c = 12.5, and E = 1, 

as the remaining component of the empirical equation. It is from this equation 
that the lower curve of Figure 77 has been plotted. This is the third of the fotar 
eguations anchoring the critical construct^ sBr, on ihe conseguerd side io olgeo 
twdy dbservaJble 'phenomeria. 

It should be noted in this connection that certain evidence, such as that repre- 
sented by Figure 57 and Table 6, appear^superficially to be in conflict with the 
linear relationship of n as a function of represented by the above equation 
and in Figure 77. It is pebble that the apparent disagreement is due to the 
fact that both Table 6 and Figure 57 are based on processes under autonomic 
control, whereas Figure 77 presupposes striated muscle response. Because of 
the seeming inconsistency in the evidence, Postulate 14 and the above equation 
expressing it cannot be accepted without reservation until definite confirmatory 
evidence becomes available. 


The Equations From Which the Curves of Figure 78 Were Plotted 
The equation fitted to Hovland's empirical learning curve data (Figure 21) is, 
.141 [100 (1 - 10--053Ar)] + 3.1. 

By inspection, the bracket "may be' seen to enclose a simple positive growth func- 
tion of iV. Accordingly we write directly the equation, - 

Mr - 100 (1 - 

Enlacing the bracket of the orignal equation by we have as the i^coiai 
<x>mponent, 

A = MIMr + 3.1. 

It is from tins tiiat the lower curve of Figure 87 has be^ plotted. TMs u the 
lad of idke f(mr equoMons cmchoring the criUcol cond^met on the ecmsegtieal steto 
to objedwdy ol^mcMe j^ienomenoL 
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CHAPTER XIX 


The Patterning of Stimulus Compounds 

In our account of the “law of reinforcement” (p. 71 ff.) it was 
pointed out with care that reactions are conditioned to the central 
afferent impulses (s) set in motion by the action of stimulus ener- 
gies (S) upon receptors. Because of the approximate one-to-one 
relationship between S and s, in many instances the influence of 
the principle of afferent neural interaction (p. 42 ff.) was largely 
ignored in the interest of. expository simplicity. For similar rea- 
sons the distinction between sBr and sHr was not stressed, and 
the habit strengths to the evocation of a given reaction associated 
with different stimulus elements or aggregates occurring together 
were reported in the main, though not exclusively (p. 216 ff.), as 
summating by a kind of diminishing returns principle (p. 223 ff.). 
Moreover, a detailed account of the role of afferent neural inter- 
action, particularly in the patterning of stimulus compounds, could 
not be given until the reader had been familiarized with the major 
phenomena of conditioned inhibition (p. 282 ff.), of its generaliza- 
tion (p. 283 ff.), and of oscillation (p. 304 ff.). Now that all of our 
primary principles have been set forth, we may present in a little 
detail some further important implications of the principle of 
afferent neural interaction. 

For reasons presently to be disclosed (p. 374) there are many 
situations in which organisms would have about as good chance 
of survival if they responded as if their reactions were conditioned 
to stimulus elements (5) as they woiild if their reactions were 
conditioned to central afferent impulses (s) ; moreover, habits do 
in fact summate (p. 209 ff.). TWe are, however, innumerable 
situations in which the response must be made to a certain com- 
bination or configuration of circumstances (B, p. 503) and in which 
that response would not be reinforced if given to any single cir- 
cumstance as represented by isolated stimulus elements or aggre- 
gates ^ or in any combination. For example, a red light suspended 

^Actually, of cxjurse, stimulus elements probably never literally occur 
alone; a stimulus element or aggregate is said to occur “alone” ^en its 
onset occurs alone, the onset of the other stimuli having occurred earlier or 
later. A stimulus element is a stimulus energy which activates a an^ re- 

349 
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over a street intersection will cause a man to halt when his goal 
would lead him to cross the street, but a red light in a drugstore 
window will not cause him even to slow his pace; he responds not 
to the red light alone, but to it as a component in a particular 
combination of stimulus aggregates. Now, as a rule, learning to 
react, or not to react, to a stimulus combination as distinguished 
from its components is more difficult than the simple conditioning 
of a reaction to a stimulus compound. This learning to respond to 
stimulus combinations or configurations, as such, we shall call the 
'patterning of the stimulus compound in question. 

For purposes of convenience we shall distinguish two main forms 
of stimulus patterning. The one to be considered first, because con- 
ceptually the simpler, is that in which *the onset of the several 
stimulus energies involved takes place at the same time; this is 
called simvltaneom stimulus patterning. The second form is that 
in which the onset of the several stimulus energies occurs succes- 
sively; this is called temporal stimulus patterning. In some cas^ 
the response will be reinforced when it follows the presentation of 
the stimulus combination; this results in positive patterning. In 
other cases, reinforcement will occur only when response follows 
the separate presentation of the stimulus components; this results 
in an extinction of the tendency for the compound ‘to evoke the 
reaction and is therefore called negative patterning. 

SOME EXPERIMENTAL EXAMPLES OF SIMTJLTANEOTJS STXAIULXTS 

PATTERNING 

The first experimental attack on the problems of the pattern- 
ing of stimulus compounds began in the laboratory of Pavlov, 
though he did not use this expression in connection with the experi- 
ments. Unfortimately he gives no example of simultaneous pat- 
terning. He dc^, however, make this summarizing statement: 

II was noticed that if a conditioned reflex to a compound stimulus 
wm ^tabiished , . it was ^sy to maintain it in full strength and at 
the same time to convert its individual components, which gave a positive 
^eet when singly, into negative or inhibitory stimuli. This re- 

rft k obtained by constant reinforcement of the compound stimulus, 
wbie its components, on the frequent occasions when they are applied 
remain without reinforcement. This experiment can be made with 

^tor oi^tn. A a^egate may be defined as a group of stimulus 

aS of which iMuaiy begin and terminate at the same time, €.g., the 
irtimnte aMsrgl® givai o€ by an c^ject such as an apple. 



THE PATTERNING OF STIMULUS COMPOUNDS 351 

equal success in the reverse direction, making the stimulatory compound 
into a negative or inhibitory stimulus, while its components applied singly 
maintain their positive effect. (8, p. 144.) 

Fortunately there have recently become available some detailed 
examples of closely analogous forms of simultaneous patterning in 
dogs; these are from a study reported by Woodbury (10), In one 
experiment a dog named “Dick” was placed in a wooden stock 
much like that employed by Pavlov. Just in front of the dog's 
head was a light horizontal wooden bar; when this bar was raised 
about half an inch it closed an electrical circuit, activating an 
electromagnetic device which released into a pan before the dog 
a pellet of appetizing food. The animal was first taught to lift the 
bar with his nose to secure the food. Later a wooden shutter was 
placed before the bar to prevent the dog from nosing it except 
immediately after the stimuli to be patterned were given. These 
stimuli were produced by two buzzers, one with a low-pitched, 
rather raucous tone (represented by L), and the other with a defi- 
nitely higher-pitched and much less raucous tone (represented by 


TABLE 11 

Quantitative Statement of the Cotjese op the Patterning op a Simux«- 
TANEOTjs Stimulus Compound LsAHimD by Woodbury’s Dog, *‘Dick,” To- 
gether With the Steps in the Derivation op the Patterning Coefficient. 
(Derived from an unpublished table from Woodbury’s data, 10 .) 


Suce^ive 

Hundreds 

of 

Differential 

R^nforce- 

ments 

Per Cent 
Reaction 
Evocation 
by 50 
Presen- 
tations of 
CoDgound 

(Q) 

Per Cent 
Reaction 
Evocation 
by 25 
Pr^n- 
tations of 
Component 
H 

Per Cent 
Reaction 
Evocation 
by 25 
Plan- 
tations of 
Component 
Li 

Mean 
Per Cent 
Reaction 
Evocation 
by H and L 

{Q) 

Psttermng 

CoefiScien^ 

<2-^ 

(P) 

1 

91 

91 

92 

91.5 

-.5 

2 

100 

100 i 

100 

100.0 

0.0 

3 

100 

100 I 

100 

100.0 

0.0 

4 

100 

100 

100 

100.0 

0.0 

5 

100 

76 

100 

! 88.0 

12.0 

6 

100 

S 

92 

50.0 

50.0 

7 

100 

12 

48 

30.0 

70.0 

8 

100 

0 

40 

msL 

m,o 

9 

100 

0 

40 


mM 

10 

100 

0 

25 

12.5 

87.5 

11 

100 

0 

0 

.0 

100.0 

12 

100 

0 

4 

2.0 

^.0 

13 

100 

0 

0 

.0 

100.0 
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H). The apparatus was so set that when the buzzers were sounded 
simultaneously (HL) and the dog nosed the bar, the food would 
be delivered, but w^hen either buzzer was sounded alone {H or i) 
and the dog nosed the bar, no food would be delivered. 

According to Pavlov's summary statement quoted above, this 
differential reinforcement, as it is called, should differentiate the 
compound from the components. Table 11 and Figure 79 show 
that such a differentiation did in fact occur and that positive pat- 
terning resulted. The dog learned practically never to nose the 
bar when either H or L was presented alone but practically always 



Fig. T^. Graph showing in detail the course of the learning of Woodbury’s 
dog, ‘*Dick,” to react positively to a simultaneous stimulus compound con- 
sisting of a high-pitched buzzer (if) and a low-pitched buzzer (D, and not to 
react to the components, i.e., the buzzers presented separately. Of each one 
hundred trials, 50 were of the compound {HD, 25 were of the high-pitched 
component, and 25 were of the low-pitched component. This figure was 
plotted from the values shown in Table 11. (Adapted from Woodbury, 10.) 

to nose it at the presentation of the compound, EL; this process, 
however, was a protracted one requiring some 1300 trials. An 
mcamination of Figure 79 reveals, in addition to the fact of final 
differentiation or successful patterning, the following details of the 
learning process: 

1. AlthoughL before the introduction of differential training the a-n imal 
was r^|X)iidiiig ICX) per cent of the time to the stimulus of the bar fol- 
lowing the lifting of the shutter, ^>on after this training began both the 
compound and the components showed some tendency to extinction or 

extinction eff^ts, as indicated by the initial depr^ion of all 
tiirae curv^ 

2. As pimcti^ continued, all thr^ curves rc©e to 1(K) per cent, where 

for rame trials. 
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3. Following this recovery, each component stimulus gradually lost its 
power to evoke the reaction, the high-pitched buzzer distinctly less slowly 
than the other, 

4. The general shape of the falling curves approximates roughly that 
of the ogive; i.e., at first the fall is positively accelerated, after which the 
acceleration reverses itself and becomes negative, finally approaching the 
horizontal at a zero frequency of response. 

In full verification of Pavlovas assertion quoted above, Wood- 
bury reports the detailed record of the pattern learning of a second 
dog, “Chuck,” in which the component stimulus elements in the 
experimental arrangement just described were positive and the com- 



Fig. Graph showing in detail the course of the learning of Woodbury's 
dog, “Chuck,” to react positively to the components of a simultaneous audi- 
tory stimulus compound (H) and (D and to react negatively to the com- 
pound itself {HD, Out of each 100 presentations, 50 were HL, 25 were H, 
and 25 were L, 


pound was negative. The course of this bit of n^ative pattern 
learning is shown in some detail by the curves of Figure A 
comparison of this figure with Figure 79 reveals the following: 

1, After the first shock of the non-reinforcement associate with the 
difierential training, the positive components rose to 100 per cent and 
maintained that position substantially to the end of the traming, much 
as did the pc^itive compound in Figure 79. 

2. Upon the whole, the components (pc^tive) showed more of a 
tendency to fall below KK) per (^t in I%ure 80 than did the ccm|K>iaKl 
(pc^tive) in Figure 79. 

S. While the negative compound of Figure 80 began to its efer^ve 
excitatory pK>tential more promptly than the n^ative com|x>nents in 
F^re 79, the former showed a greater r^tan<^ to mmpiete estinction 
than the Latter, 
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PRINCIPLES UPON WHICH THE PATTERNING OP STIMULUS 
COMPOUNDS IS BASED 

While Pavlov was completely familiar with the various aspecte 
of the experimental phenomena of patterning, it would appear that 
he regarded patterning as an ultimate molar phenomenon in that 
he did not succeed in breaking it down into more elementary prin- 
ciples. This is a little surprising, since he was well acquaints 
with most, if not all, of the principles required for doing so, notably 
the principle of afferent neural interaction (see Chapter III, p. 
42). Because of the importance of patterning for behavior dy- 
namics, such a derivation will presently be given. 

The reader is now familiar with the molar principles upon 
which the theoretical derivation of the patterning of stimulus com- 
pounds is based. Tor convenient reference they are summarized 
as follows: 

a. Habits are connections between receptor -•discharges and effector 
discharges, as shown by the following diagram (p. Ill), 

S- >$ — *»r- ►B, 

in which the arrow with the broken shaft between s and r represents 
the habit, or H of the symbol (Postulate 4, p. 178 ff.). 

b. Afferent impulses (a) interact, changing all impulses involved to a 
greater or le^ degree, while on their way to that point in the nervous 
system at which the reinforcement process makes the junction between 
the a and the r; the expression s represents the s as changed by inter- 
action with another s (Postulate 2, p. 47 ff.). 

c. When s has become a, this represents a change in position on a 
generalization continuum (p. 188) . Thus, s would tend to evoke a reaction 
conditioned to though because of the generalization gradient the habit 
strei^th, and so the reaction potential would be weaker than 
|Tjb (Pcstulate 5). 

d. In the ea^ of complete experimental extinction, the total inhibition 
fe a|uai to tl^ difference between the reaction potential and the reaction 
thn^idid, (p. 3^ff.), i.e., 

e. The gradieite of g^ueralization of both excitatory potential, gEs, 
inhibitory jxjtential, have approximately the same dope; ie., 

have the ^me exponent (p. 275 ff.). 

Otter tiie magnitude of the interaction effects tetweei 

mpiis^ from a nsjeptor field, eg., the retina, will be 

pmte ttet betweoi impulses which arise from distinct receptor 
eg., tie return aaui the tactile r^eptors. 
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g, other things equal, the magnitude of the interaction effect of one 
afferent impulse upon a second is an increasing monotonic function of the 
magnitude of the first (Postulate 2, p. 47 ff.). 

h. When a stimulus energy ceases to act on a receptor, the condition- 
able afferent impulses set in motion by the stimulation continue to be ac- 
tive in the central neural substance for some seconds; the intensity of 
this perseverative tendency gradually diminishes as a negative growth 
function of the time since the termination of the action of the stimulus 
on the receptor (Postulate 1, p. 47). 

i The empirical degree of stimulus patterning (P) where the reaction 
m of the ali-or-none type (such as that displayed by Woodbury’s dogs) 
may be represented sufficiently well for our present expository pur- 
pose by the formula, 

P-Q-a (52) 

where Q represents the mean empirical per cent of reaction evocations by 
the reinforced phase of the compound, and Q represents the mean per cent 
of reaction evocations by the negative or non-reinforced phase of the 
compound. 

j. The theoretical degree of stimulus patterning (P') may be con- 
veniently represented for our present purposes by the formula, 

F = (53) 

in which O' is the effective reaction potential of the negative portion of 
the discrimination, and is the effective reaction potentiaL of the posi- 
tive portion of the discrimination. 

Both the P and the P' formulas yield a value of 100 for perfect 
or complete patterning, and one of zero where no patterning ten- 
dency exists. In case patterning is negative, both formulas yield 
a negative value. Several examples of the use of the second, or P' 
formula, will be encountered in the immediately following pag^. 
An extended example of the use of the empirical or P formula is 
contained in Table 11. Substituting appropriatefy in this formula 
from the first row of entri^, we have, 

P = 91 - 91.5 

= - A 

which yields a slight and presumably alypieal native v^due; ibis 
value constitutes the first entry in ihe last column. The pa^mi- 
ing index or coefficient, P, reveals effa^ively ihe eoui^ of the 
acquisition of patterning as a unifi^ proce^ This is brou^t 
out strikingly by the graphic represaatation of the P-valnes m a 
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I 2 3 4 5 6 7 8 9 10 II 12 13 

SUCCESSIVE HUNDREDS OF DIFFERENTIAL REINFORCEMENTS 

Fig. 81. Graphic representation of the progress of the positive patterning 
of a simultaneous stimulus compound learned by Woodbury's dog, “Dick” 
Plotted from the final column of Table 11. Note the approximately ogival 
form of this curve. (See Chapter XIII, p. 204 ff.) 

function of the number of differential reinforcements, which may 
be seen in Figure 81. 


TianiKiREricAL debivatton op the spontaneotjs or quasi- 
patterning OF simxjdtaneous stimulus compounds 

Prtxjeeding to the application of the above principles to specific 
valu^, we shall first make a theoretical analysis of the tendency 
to spontaneous or differentially imleamed patterning exhibited by 
a simple conditioned simultaneous stimulus compound in which the 
componente are potentially negative, i.e., about to be unreinforced. 
In Uiis way will he rounded out a discussion begun above (Chap- 
ter XIII, p. 2)4) concerning the influence of afferent interaction 
cm the dynamite of stimulus compounds (p. 217 ff.) . 

Let it be aMon^ toat by the Pavlovian technique a simultane- 
mm ^miilus compound of two components (Sx and 82 ) is condi- 
to a readacm (ij) stron^y ^ou^ for St to command a 
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habit strength of 40 habs and Sg to command a Jiabit strength of 
60 habs. Taking the value of the effective drive, D (Chapter XIV, 
p, 245), at unity we have, 

= 40 wats 


and 


=60 wats. 


Xow, by the physiological summation of these two excitatory poten- 
tials, we have, 


O' = jr, + jr^E = 40 + 60 - 


= 100-24 
/. O' = 76 wats. 


We next consider the excitatory potential of the two compo- 
nents if each is acting alone, i.e., independently. Since the respec- 
tive afferent impulses will not be interacting upon the separate 
presentations of & and Sg, the afferent impulses resulting from their 
individual action on the receptors will be represented simply by 
Si and Sg, i.e., without the breve. Now (Postulate 6), the shift from 
the Si of the compound to the Si of separate action will involve a 
generalization effect and therefore a fall in excitatory potential from 
tjEs to s^Ejs, Let it be assumed that throughout this particular 
problem, unless otherwise stated, the reduction from I to s, or vice 
versa, amounts to 25 per cent. Accordingly, we have as the sepa- 
rate action of the respective components, 


sJEr = 40 — 10 = 30 wats 
= 60 — 16 = 45 wats; 

and, by calculated physiological summation, these two potentially 
negative stimulus components would exert the equivalent of, 

= 75 - 13.5 
A ^ = 61.5 wats. 
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Next, substituting in the formula for calculating the theoretical 
patterning index (P'), we have, 

^-™0-§ro) 

= 100 (1 - .809) 

= 100 X .191 
= 19.1. 

This means that at the very outset of the patterning procesSj m 
case the components are negative, afferent interaction produces a 
small amount of spontaneous positive patterning. 

We turn next to the theoretical analysis of the spontaneous 
patterning of a simultaneous stimulus compound in which the com- 
ponents are separately reinforced and the compound is potentially 
negative. In that event, making assumptions analogous to these 
of the preceding case, we have, 

sJEr = 40 wats, 

and 

tJEs = 60 wats, 

which if summated physiologically would yield, 

O' = = 76. 


Similarly, by the process of generalization from $ to s, each ex- 
citatory potential in passing from the separate state to that of 
the compound would reduce the reaction potentials from 40 to ^ 
wate, and from 60 to 45 wats respectively, which by physiolo^cal 
summation would yield, 

^ = 61.5, 

ju^ as tefore. 

Substituting th^ Q- values in the formula for patterning, we 
have, 

P' - 19.1, 

exactly as whm the compound was reinforced. These conadera- 
tions l^d us to our first corollary: 

I. FoUmmm f Aa application of positive reinforcement hut pre^' 
vums to the ap^icatum of unreinf&rcement, situations involmng 
stim'ulw (^m^tmds will show a certedn tendency to 
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the strength of the tmdency hem the same whether 
pattern y, components are reinforced. 

^ wTStb now to the consideration of the influence on the amo^t 
We of increasing the amount of neural mter- 

of sponteneou p tte g interaction, the 

generalisation from a to a and ^ce 
^ T et it be supposed that this reduction is mcreased from 
fhr^S per cent assumed above, to 50 per cent. In that case, on 
^u^pLns otherwise the same as above, if the compound is posi- 
tive we have after generalization, 

^ Eb = 40 - 20 = 20 , 


= 60 - 30 


: 30. 


By physiological summation, assuming the components to be nega- 
tive, we have, ^ 

Q = 20 + 30 
= 50-6 
= 44. 

Since, exactly as when the generalization reduction was 25 per cent, 

Q = n+SjEa f= 76, 

we have, by substituting in the patterning formula, 

F = 100(l-f|) 

= 100 (1 - .578) 

= + 43.2. 

This value of +42.2, when the g--alization 
neural interaction is 50 per cent, is to be compar ^ 

P-value of 19.1 when the generalization f 

Since, by Corollary I, the spontaneous tendency ^^^!^^2ary to 

same wLn the components are positive, it 

derive that case for the decreased our sec- 

Generalizing from the above calculation, we amve at our sec- 

ond^oMter^ aferent neurd 

yr^ter l XTthe amount of ^‘^ntaneou^’ pattenmg, both pon 

Uve and negative. 



360 


PRINCIPLES OF BEHAVIOR 


TBGEORETICAL DERIVATION OF THE POSITIVE PATTERNING OP 
SIMULTANEOUS STIMULUS COMPOUNDS BY 
DIFFERENTIAL REINFORCEMENT 

The theoretical derivation of quasi or spontaneous patterning 
has much in common with that of the genuine patterning of stim- 
ulus compounds which is attained by means of differential rein- 
forcement. Accordingly the above account of the former will serve 
as a useful introduction to the slightly more complicated derivation 
of the latter now to be given. The two processes have in common 
a dependence upon afferent interaction with the consequent opera- 
tion of the generalization gradient, physiological summation, etc. 
Genuine patterning involves the additional complication of the gen- 
eration of experimental extinction {Ir) which necessarily results 
from the non-reinforcement that is a part of differential reinforce- 
ment.^ There is also involved the associated generation of condi- 
tioned inhibition (sIr) and the generalization of a- portion of this 
back upon the positive or reinforced phase of the patterning situa- 
tion. 

We shall take as our first example the case in which the com- 
pound is positive (reinforced). In order to make the exposition 
as intelligible as possible, we shall assume, as in the above exam- 
ples, that we have a compound of only two stimulus components, 
Si and S 2 , and that the process begins with the initial conditioning 
of the compound to the reaction, R; this yields, as before, excitatory 
potentials as follows: 

'iJEs = 40 wats 
= 60 wats, 

which, by physiological summation, yields a \^s^r of 76 wats 
(p. ^3). Now, by generalization, %jEr shrinks 25 per cent from 
a value of 40 to one of 30 and shrinks from a value of 60 to 
one of 45, i.e., 

= 30 wats 
and 

t^R = 45 wats. 

A^uming a reaction threshold of 10 wats as a minimum neces- 
sary to evoke reaction (see p. 323 ff.), the non-reinforcement of the 

^ TEe gei^ratioai of inhibitioii during reinforcement is here neglected in 

inters of expodtory amplicity. 
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separate components and to the reaction threshold (10 
watsi would reduce each to a value of lero respomesg with the 
generation of inhibition amounting to, 


and 


30 — 10 = 20 pavs 
45 — 10 = 35 pavs. 


Assuming that half of each of these values is made up of condi- 
tioned inhibition and therefore is subject to stimulus gen- 

eralization (p. 281 ff.), and recalling that^ by assumption, 75 per 
cent stimulus generalization occurs between s and or vice versa, 
we have as the amount of inhibitory potential generalizing from 
the r^pective non-reinforcements back upon the corresponding 
reinforcement phase, 


^ X .75 7.5 pavs 

^ X .75 - 13.13 pavs. 


Summating these inhibitory potentials physiolo^cally, we have, 


= 7.5 + 13.13 - 


7.5 X 23.13 
lOQ 


= 20.63 - .99 
— 19.64 pavs. 

From this it follows from the formula for sEb tliat. 


i.e., 


== 76 - 19.64 
= 56.36, 

O' - 56.36 wats. 


On the other hand, since the two threshold valu^ left by tie 
tion process never operate together, they will be average, rather 
than summated, i.e., 


10 wate. 
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Substituting these Q-values in the fonnula for patterning, we have, 

= 100 (1 - .177) 

= 82.3, 

which represents a large degree of patterning. 

Suppose, on the other hand, that we employ the empirically 
more realistic response formula for observed patterning, 

P = Q - Q. 

The per cent of responses to equals 10 wats (after extinction 
to zero) , and equals the same. But, 

sLb = 10 , 

and when 

= sPs, 

p = 0, 

i.e., 

Q = 0. 

Xext, if we assume that the maximum range of oscillation (sOb) 
is 40, since, 

O' = 56 wats 

and the threshold is taken as 10 wats, it follows that oscillation 
will frequently carry the reaction potential below the reaction 
threshold (sLj?),i.e., 

56 - 10 > 40, 

from which it follows that the will yield 100 per cent reac- 

tion evocations, and the value of Q will be 100 per cent. Substi- 
tuting these values in the formula for P, we have, 

P^Q-Q 
= 100 - 0.00 
= 100 ; 

ie., empirical pattening (P) will he perfect. Thus we arrive at 
mac tMrd eoroEary: 
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III. In case a simtdtamom atimidm compound and its com- 
ponents receive dilferential reinforcement, the components bemg 
negative {unreinforcedi, sufficient training will produce complete 
empirical patterning provided the differences between s and s are 
great enough to bring about a residual which exceeds the 

range of behavioral oscillation (gO^) plus the reaction threshold 
[sLe) • 


THEOEETICAIi DERIVATION OF THE NEGATIVE PATTERNING OF 
SIMULTANEOUS STIMULUS COMPOUNDS BY 
DIFFERENTIAL REINFORCEMENT 


We turn next to the case of the patterning of a simultaneous 
stimulus compound in which all the conditions are assumed to be 
the same as those in the preceding example, except that the com- 
pound is negative, i.e., Si and Ss are reinforced separately until 


and 


— 40 wats 


= 00 wats. 


Then Si and S^ are presented as a compound and given differential 
reinforcement. By reasoning exactly analogous to that of the pre- 
ceding example, s will generalize to the 5 of the compound to the 
extent of 75 per cent, yielding, when in the compound at the out- 
set of differential reinforcement, 


= 30 wats 


and 


't^B = 45 wats. 


Since these are extinguished jointly in each will contribute 

roughly equal amounts to the 10-wat threshold, i.e., Q' = 10. This 
will leave to be extinguished by the negative aspect of differential 
reinforcement, and so to be converted into inhibition, 


30 — 5 = 25 pavB 


and 


45 — 5 = 40 pavB. 
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If, now, half of this inhibition in each case is bIr and 75 per 
cent of Jr generalizes, we have, 

- 2 ^ 

'sJb ~ '2 ^ ~ PaVS, 

^ X .75 = 15.00 pavs 


to generalize back on sjEb and respectively. It follows that. 


and 


,jEs = 40 — 9.38 = 30.62 wats 
= 60 — 15 = 45.00 wats. 


Since the two negative potentials are generated together, they are 
already sununated physiologically at the reaction threshold, i.e., 


O' = 10 wats. 


On the other hand, since sjEjs and s^Eb never operate together, they 
will be averaged: 

_ 30.62 + 45.00 
^ 2 


O' == 37.81 wats. 


Substituting these Q-values in the patterning formula, we have, 

= 100 (1 - .264) 

= 100 (.736) 

= 73.6, 

which reiH^^aits a considerable patterning tendency. 

Suppose, now, we consider the response aspects of this situa- 
tion. It may be seen that, assuming the range of behavioral oscil- 
lation (sOb) to be 40 wats, since, as stated above. 


and since 
md 


sEb = 45 wats, 


Jjb = 10 wate, 


45 - 10 < 40, 
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Sf will not evoke R KX) per cent of its presentations but, by Table 
9, would yield about 96 per cent of reaction evocations.^ Also 
since 


and since 


= 30.62 wats, 
sLs == 10 wats, 


32.62 ~ 10 < 40, 


it follows that St will evoke R at less than 100 per cent of its 
presentations, i.e., at approximately 61 per cent of the stimulations.. 
Thus, 




96 + 61 
2 


78.5. 


Accordingly, substituting in the empirical patterning formula we 
have, 

P = 78.5 - 0.00 


= 78.5, 

a degree of response patterning appreciably less than perfect and 
so definitely" less than the degree of response patterning found 
under comparable conditions where the compound was positive. It 
is evident, however, that if and were sufficiently rein- 
forced to bring each of them up to a value of about 70 habs, both 
sAb would mediate the evocation of R on 100 per cent 
of the trials, in which case the empirical patterning index would 
be 100, i.e., perfect even under the conditions of component rein- 
forcement. This brings us to the statement of our fourth corollary: 

IV. In case a simultaneous stimulus compound and its compch 
nents receive differential reinforcement, the compound being unrein- 
forced, sufficient training will produce complete empirical patterning 
provided the differences between s and s are great enough to produce 
a residual s^b which exceeds the range of behavioral oscillation 
(sOe) ; but, other things equal, this type of learning will require a 
greater number of differential reinforcements to attain a given de- 
gree of patterning than will that in which the components are unre- 
inforced. 

1 The statement of the method of determining this value and the reasons 
for so doing were indicated above, p. It may be said here, however, 

that the major assumption underlying the procedure is that the value of 
bEx oscillates below its expressed potential magnitude and that the calcu- 
lation involved the use of the probability function of behavioral oscillation 
^own in Table 9 (p. 311). 
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The tendency of empirical patterning to break down through 
the failure of the components to evoke the reaction at 100 per cent 
of the stimulations is nicely illustrated in Figure 80, where, at the 
twelfth hundred differential reinforcements, we have, 




96 + 100 

o 


98 per cent. 


At the same time, 

Q = 8 per cent, 

which yields as an index of patterning, 
p = 98 - 8 = 90, 

a value distinctly less than perfection. 


THE POSITIVE PATTEBN'IHG OF SIMHLTANEOITS STTMULUS 
COMPOUNDS UNDER CONDITIONS OF 
DECREASED GENERALIZATION 


We now come to the final step in our analysis of the process 
of stimulus-patterning learning by differential reinforcement. Let 
us consider the influence on this process of an increase in the affer- 
ent interaction effects suflBcient to steepen the fall in the general- 
ization gradient from 25 per cent to 50 per cent. In order to sim- 
plify ihe exposition we shall take the original case of a simultaneous 
stimulus compound in which the compound is first reinforced so 
that, 

= 40 wats 
and 


= 69 wats. 


Now, by generalization, with a loss of 50 per cent, it follows that, 


= 20 wats 
and 

= 30 wats. 

The ^parate extinction of each of these to the reaction threshold 
of 10 wats would generate inhibition in the amount of 

20—10 10 pavs 

and 

30—10 20 pavs. 
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Assuming that half of each of these generalizes to the extent of 
50 per cent, we have, 


and 


= -^ X .50 = 2.5 iHivs 
- 20 

♦yfs = -^ X .50 = 5.0 pavs. 


Summating these inhibition values, 

+ = 2.5 + 5.0 - 

= 7.5 - .01 
= 7.49 pavs. 


2.5 X 5.0 
100 


It follows that 


i.e.. 


= 76 - 7.49 
= 68.51, 

Q! = 68.51 wats. 


As in the example invoMng 25 per cent generalization, the thresh- 
old reaction potentials remaining in gfis and are 10 wats 
each. 


Q' 


10 4-10 
O 


= 10 wats. 


Substituting in the following formula, we have, 

P' =m(l gggj) 

= 100 (1 - .146) 

= 100 X .854 
= 85.4. 


This value, it will be noted, is larger than the 82.3 yielded by the 
smaller fall in the generalization gradient. 

However, just as in the case of the 25 per cent generalization 
reduction, both components, despite the threshold values of 10 pavs, 
will yield 0.00 per cent of reaction evocation, i.e.. 




0.00 + 0.00 


= 0 . 00 . 


o 
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On the other hand, since 

68.51 - 10 > 40, 

the compound will evoke R on 100 per cent of the presentations, 
i.e., 

Q = 100. 

Accordingly, 

P = 100 - 0.00 
= 100 ; 

i.e., empirical patterning will be perfect. This brings us to our 
fifth corollary: 

V. In the 'patterning of stimid'us compounds, the greater the fall 
in the generalization gradient het'ween s and s, the less will be the 
difficult]/ in attaining a given degree of discrimination, 

SOME EXPEEUMEKTAL EXAMPLES OF THE LEARNING OF 
TEMPORAL STIMULUS PATTERNS 

We turn now to the problem of the patterning of temporal 
sequences of stimuli, i.e,, the learning of temporal stimulus pat- 
terns. Pavlov, in whose laboratory this type of learning seems first 
to have been studied, describes several such experiments. In an 
experiment performed by Dr. Eurman, a dog was presented with 
a sequence of three stimuli: a light (L), a cutaneous stimulus (C), 
and the sound of bubbling water (S), When these stimuli were 
given in the order LCS, the presentation was always followed by 
focxi reinforcement, but when they were given in the reverse order, 
i.e., SCL, the combination was never reinforced. The dog finally 
reached a point of training such that it secreted an average of 
alK)ut 8 drops of saliva to LCS, and zero drops to SCL, which pre- 
sumably indicated perfect patterning. Unfortunately, Pavlov does 
not report the course of the learning of this or any other pattern, 
though he makes a general statement that such learning often re- 
quires protracted training and when achieved is very unstable {8, 
p. 147). 

Fortunately we have in the study by Woodbury (10) already 
referred to, a detailed report of the course of both the positive 
smd the negative forms of temporal patterning. The same appa- 
ratus arrangement and general procedure were employed as de- 
alxjve for the patterning of simultaneous conditioned com- 
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poiindSj but '^uth this difference: the high-pitched buzzer was 
sounded for one second; then, after a pause of one second, the 
low-pitehed buzzer was sounded for one second, following w^hieh 
the shutter was raised, giving the dog access to the bar. If the 
dog nosed the bar, the act was reinforced by food. On the negative 
side, the high-pitched buzzer was presented for one second twice 
in succession, with a pause of one second betw^een presentations. 
On other occasions the low-pitched buzzer w’ould be presented twice 
in the same w^ay. After each of these presentations the shutter 
was lifted, but if the dog nosed the bar no food w’ould be given. 
The learning behavior of the dog ^Ted,” by this procedure, is shown 
in detail by Figure 82. A study of this figure indicates in general 



CROUPS Of 100 TRIALS 

Fig. 82. Graph showing in detail the course of the learning of Woodbury’s 
dog, ^‘Ted,” to react positively to a temporal stimulus compound of a high- 
pitched buzzer followed by a low-pitched buzzer {HJj), and negatively to a 
parallel presentation of the high-pitched buzzer and of the low- 

pitched buzzer (LJj). Out of each 100 presentations, 50 were 25 were 
HJI, and 25 were LJb. (Adapted from Woodbury, 10.) 

a striking agreement wdth the course of the patterning of simul- 
taneous stimulus compounds, except that the amount of training 
required to complete the process of temporal patterning was notice- 
ably greater. There is the same initial depression of all three 
curves at the outset of differential reinforcement, and the same 
subsequent rise of all three to 100 per cent; following this the two 
non-reinforced component combinations gradually fall toward zero, 
taking a roughly ogival course. 

The exact reverse of the experiment just described was cani^ 
out by Woodbury with the dog “Bengt”; i.e., the temporal m- 
quences HyH and L,L were reinforced, but the i^uence HJj was 
never reinforced. The course of this learning may be seen in 
Figure 83, a study of which reveals the same general feature as 
those of Figure 80, though the difficulty of learning is somewhat 
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greater. It may also be noted that Figures^ 80 and 83 both show 
a greater amount of disturbance (weakening) of the reinforced 
phases than do Figures 79 and 82; this increased disturbance pre- 
sumably comes from the generalization of the greater amount of 
inhibition arising from the extinction of the non-reinforeed com- 
pound upon the relatively weak component. 

It is to be noted that even though the values represented in 
the preceding figures are derived from the pooling of a very large 
number of observations, the performance of a single animal is not 
sufficient basis for the establishment of empirical laws, though it 



Fig. 83. Graph, showing in detail the course of learning of Woodbury’s 
dc^, **Bengt,” to react positively to the temporal combinations Hfi and LJj, 
and to react negatively to the combination HJj. Out of each 100 presenta- 
tions, 100 were 25 were HJB., and 25 were LjL. (Adapted from 
Woodbury, 10.) 

may serve to illustaute theoretical principles; it is as such that 
Woodbury’s graphs are offered here for consideration. However, 
the performances of ihese single animals give complete and suffi- 
cient proof of one thing: they demonstrate that the positive and 
the negative forms of both types of stimulus patterning Cdn be 
learned by dc^. 

A akabtsis of temporal stimulus patternin'G 

The iheoretical analysis of temporal stimulus patterning is at 
bottom about the same as that of simultaneous stimulus pattern- 
ing. Hiis is to say that temporal stimulus patterning depends 
upon substantially the same principles: afferent neural interaction, 
the generalization of excitation, the extinction of the generalized 
excitation, and the final ^neralization of this inhibition back upon 
tiie iKJsitive reaction potential. There is, however, this important 
diffemice: in temporal stimulus patterning the neural interaction 
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presumably takes place between the afferent impulses arising di- 
rectly irom the second stimulation and the p6rs6ver(itiv6 sliTuuius 
traces (Postulate 1) which were originally set in motion by the 
earlier stimulation. Thus the neural interaction of the componenlB 
of temporal stimulus patterns is a simultaneous affair exactly as 
is that of the components of simultaneous patterning; the difference 
lies in the fact of the temporal as:^Tichronism of the action of the 
respective stimulus energies which originally set the interacting 
impulses in motion. From the point of view of adaptation and 
survival this difference is of enormous importance, but the pattern- 
ing mechanism itself differs little except quantitatively. 

It is evident from the foregoing that having postulated per- 
severative stimulus traces (Postulate 1), the patterning of temporal 
stimulus compounds follows at- once, by reasoning exactly analogous 
to that by which Corollary III was derived. This brings us to the 
statement of our sixth corollary: 

VI. In case {1) a temporal stimulus compound and (£) a repeti- 
tion of either component in the same tempo as that of the presenta- 
tions of the compound, receive differential reinforcement, the repeti- 
tion of the component being unreinforced, sufficient training will 
produce complete stimulus patterning provided the difference be- 
tween s cmd s is great enough to bring aboul a residual 
which exceeds the reaction threshold by an amount greater than the 
range of behavioral oscillation (sOjr). 

Having derived the basic phenomenon of temporal stimulus 
patterning, we proceed to the examination of certain quantitative 
differences which may, theoretically, be expected to manifest them- 
selves in the comparison of the patterning of simultaneous and 
successive stimulus compounds. The most striking of these dif- 
ferences is evidently due to the progre^ive diminution in the inten- 
sity of the stimulus trace with the passage of time, as stated in 
Principle h (Postulate 1) ; i.e., other things equal, the intensity of 
a stimulus trace will be weaker than the original afferent impulse 
set in motion by the action of the stimulus energy (jS) on the 
receptor. Also, by Principle g (Postulate 1), a weak afferent im- 
pulse (sx) will produce a smaller interaction eSeoi on a second 
afferent impulse (s^) than will a strong value of Si, But the 
smaller the difference between s and I, the 1^ will be the fall in 
the generalization gradient; and, by Corollary V, the smaller the 
fall in the generalization gradient betw^n s and s, the greater will 
be the difficulty in the attainment of a given d^jree of pattern dis- 
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crimination. From these considerations there follow our seventh 
and eighth corollaries: 

VII. Other things equal, a given degree of the patterning of 
temporal stimulus compounds will require more differential rein- 
forcements than will that of simultaneous stimulus compounds, 

VIII. Other things equal, in the patterning of a temporal stim- 
ulus compound, the greater the lapse of time between the termina- 
tion of one stimulus and the beginning of the next, the greater 
will be the number of differential reinforcements required to attain 
a given degree of patterning. 


THE RESOLUTION OF HUMPHREY'S ARPEGGIO PARADOX 

A good deal of misunderstanding has arisen in learning theory 
regarding the role performed by the stimulus, presumably because 
of the hidden nature of neural interaction effects. This may be 
illustrated by an experiment reported by Humphrey {6, pp. 198, 
237); 

Subjects were trained . . * to raise their hand at the sound of a certain 
ai)ecific tone. The training was accomplished by administering an electric 
shock when the tone was sounded, but never when any other tone was 
sounded. Suppose that the active tone was G above middle C. We have 
then a conditioned reflex to this tone, which has been differentiated out so 
that no other tone producible on the apparatus was followed by response. 
By prolonged practice this conditioned reflex became very highly stabil- 
ized. Suppose now that the active note is included in a melody or an 
arpe^o such as that formed by the successive notes C, E, G, C, where 
G is the active note, [The instrument was of the xylophone type with 
mel^ cylinders, which were not damped after a note was struck, conse- 
quently the vibrations from one note persisted during the striking of the 
note, as in l^ato piano plaj^. (p. 198.)] The arpeggio then con- 
tain the stimulus for the conditioned response. The records show that 
while the i^lated note was consistently followed by response, the same 
note, rej^ted inunediatd.y in an arp^gio, was consistently not followed 
by r^ijK>n^. The mdody ‘Home Sweet Home^^ when played in the key 
of C contains the note G fourteen times. Experiments showed that sub- 
lets trained to diis note consistently respond to it when presented in 
Elation and do not respond to it when presented in the melody. This 
is very striking in view of the fact of the fourteen repetitions of the 
active note included in the melody as played, (p. 237.) 

How diall th^e experimental observations be interpreted? The 
atove aommxt seems to show that when the note G was struck, 
tihe prec^ng not^ C and E were also still vibrating, which would 
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produce a simultaneous stimulus compound. In the case of the 
melody, “Home Sweet Home,” presumably sometimes there would 
be a simultaneous stimulus compound of the “active” note and 
the two or three preceding notes combined with a temporal com- 
pound made up of the impulses from these active stimuli and the 
•perseverative stimulus traces arising from the stimulations of the 
more remotely preceding notes. Since, according to the preceding 
analysis, afferent impulses and perseverative traces operate in much 
the same way, an analysis of the simpler arpeggio situation will 
sufBce for both. 

According to the neural interaction hypothesis, the afferent 
impulse, sc, would be changed to the impulse Sc when conjoined 
with the impulses arising from Sc and Sb- This alone might easily 
weaken the reaction potential sqBr suf&ciently to produce external 
inhibition. However, it must be recalled that, by differential rein- 
forcement, there had presumably been developed a considerable 
amount of conditioned inhibition, s^Jb and BrIb- Tet it be sup- 
posed, for example, that b^^b has a strength of 50 wats and that 
bJb and sJb have strengths of 30 pavs each. Because of the pro- 
gressive damping by the air, the intensity of Sc and of Sq will be 
reduced; this should be especially true of Sc, since it was struck 
first. Therefore, both Bc^s and bJb will be weakened appreciably, 
sJb to 20 pavs, say, and srIr to 25 pavs. Now, as the result of 
afferent interaction and the consequent fall in the generalization 
gradient, both the excitatory and the inhibitory tendencies alike 
will suffer a certain reduction in strength when they enter the com- 
poimd, say 20 per cent. This leaves us with the following values: 

'iJEls = 40 wats 
•ijR = 16 pavs 
'ijs = 20 pavs. 


Summating the two inhibitions, we have, 


5,+^^^ = 16-f20- 


16 X 20 
100 


= 36 - 3.2 


= 32.8. 
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It therefore follows that the effective reaction potentiality to the 
lifting of the hand at the striking of the note G in the midst of 
the arpeggio will be, 

sJ^R 40 — 32.8 
: 7.2. 

If we assume, as in the preceding computations (p. 364) , that the 
reaction threshold has a value of 10 wats, it appears that under 
the given conditions the net effective reaction potential available 
for reaction evocation in the arpeggio (7.2 wats) would be below 
the reaction threshold and therefore the hand would not be lifted 
during the playing of the arpeggio, exactly as Humphrey found. 
Thus Humphrey’s auditory configurational problem finds a natural 
and consistent explanation in terms of habit dynamics.^ 

SOME GENERAL CX)NSII>ERATIONS CONCERNING THE FUNCTIONAL 
DYNAMICS OF STIMULUS PATTERNS 

From the point of view of causation, an organism and its entire 
environment must be regarded as a complex causal interacting 
unit (i). Probably in all adaptive situations the act of the organ- 
ian does not peld an effect which produces a particular type and 
amount of reinforcement xmtil every one of a number of different 
conditions is satisfied. In an ideal adaptive situation the organism 
would have (a) receptors which would respond differentially to the 
impact of energies characteristic of each of the several critical 
conditions, and (b) receptors responsive to each of the various 
critical conditions which would 'prevent the act from resulting in 
reinforcan^t; also (c) the stimulus energies associated with these 
i^veral conditions should actually impinge on the relevant receptor, 
e.g., they should not be shielded from the receptor by the inter- 
IKBition of ^me other object. Under these ideal conditions, pat- 
terning would seem to be the only effective form of habit structure. 
Unfortunately, even though for the most part conditions a and 6 
obtain with higher organisms, condition c is often satisfied not only 
imi^rfectly but to varying and fortuitous degrees of imperfection. 

^Ifc may be added that the context of the quotation from Humphrey 
above eeen]^ to indicate that no small portion of the confusion in this 
exmm frcan tee piirely ^mantic difSculty of having failed to recog- 
nhi^ and tee dMinction faetw^n the stimidus energy (S) and tee 

alarmt hnpulae 'Ca). 
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For this reason the question of whether a reaction in a given situa- 
tion will be followed by reinforcement is almost always more or 
less of a gamble for the organism, even in the most advanced stages 
of training. It thus comes about, as Brunswik (/) has pointed out, 
that as the number of critical stimulus cues increase, the proba- 
bility becomes greater that the total group or configuration of 
causal factors necessary for the act to e%^entuate in reinforcement 
is present in fact. Therefore, as noted in connection with the habit 
draamics of stimulus compounds, the summation of habit strengths 
which are based on the afferent impulses of the stimulus compo- 
nents, Sj as contrasted with the. interaction effects represented by 
$j is a 'primitive biological first-approximation to a calcvlm of adap- 
tive probability. This mechanism, coupled with that of the reac- 
tion threshold isLjt), prevents the organism from vrasting its energy 
by reacting when the probability of need reduction is too slight. 
Similarly, if the number of times that a given stimulus element or 
aggregate has been associated with reinforcement is small, the 
habit strength will be small, the probability of response evocation 
will be small, and so here again the organism will tend to react 
automatically to the probabilities of the situation. The implica- 
tions of the very numerous permutations of these and related 
factors cannot be entered into here, though many of them are fairly 
obvious. 

But in case a situation is sufliciently stabilized for the critical 
variable factors to satisfy condition c, as was the case with Wood- 
bury^s dogs, experiments show, on the basis of recognized prin- 
ciples, that the organism can largely transcend the initial crude 
summation calculus of adaptive probabilities by reacting, or not, 
with precision to particular combinations or configurations of stim- 
ulus aggregates. It is true that a reaction associated with a par- 
ticular pattern discrimination developed in one static setting may 
not be followed by reinforcement in another, so iiat the organism 
must continue to gamble to some extent as long as life lasts. How- 
ever, each new reaction brings with it increased training in dis- 
crimination, which makes the odds fall progressively more in the 
animaFs favor as life goes on. 

One favoring subsidiary factor is that by compotmd trial and 
error organisms learn to adjust their receptors in such a way as 
to expose them more adequately to all the environmeutal ener^^ 
relevant to a given need; ttiis is liie behavioral mechanism which 
brings about searching or exploration. Another favoring bAavioral 
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mechanism, at least in human organisms, is the conditioning hy 
social trial and error of characteristic symbolic acts such as words, 
to certain stable and significant stimulus aggregates (objects) ; this 
presumably facilitates very greatly the indirect generalization (p. 
191) of instrumentally adaptive reactions set up in one configura- 
tional situation which enables them to function in other situations 
adaptively similar but differing to a considerable degree in stimulus 
configurational characteristics. Probably this explains w^hy Razran 
{9} found with verbally sophisticated human subjects that con- 
figurational generalization was considerably wider than was simple 
stimulus generalization. Feeble-minded individuals and dogs might 
be expected to show rather different results. 

On the basis of the above considerations the conjecture is haz- 
arded that one of the more important capacities in the higher levels 
of intelligence is that of discriminating afferent interaction effects. 
It is believed that in human beings this will be found intimately 
connected with the transfer by indirect generalization of the reac- 
tions from one stimulus pattern to another through the mediation 
of words. At very high levels of adaptive ejSSciency it is expected 
that words will constitute the mediating stimuli of the stimulus 
patterns themselves. 


SUMMARY 

Many life situations require for optimal chance of survival 
that the organism shall react to certain combinations of conditions 
(stimulus compounds) differently than to the component conditions, 
either w^hen these components are encountered ^^separately” or in 
other combinations. The most radical, and at the same time the 
most simple, formulation of this problem is presented by Pavlov’s 
experimental arrangement for the discrimination of a stimulus 
compound from its components. Experiments have fully demon- 
strated that organisms over a wide phylogenetic range are able to 
learn such discriminations, though usually with comparative diffi- 
culty. 

This type of learning by organisms turns out upon analysis 
definitely to be a derived or secondary phenomenon, dependent 
upon a number of logically prior principles all of which have been 
r^ognized by Pavlov, Among the more important of these are the 
of the reaction (R) to the afferent impulse (s) set in 
motion by the stimulus (S), rather than directly to the stimulus; 
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the mutual interaction of afferent impulses; and the downward 
slope of the gradient of generalization both for excitation and for 
inhibition. For the derivation of temporal stimulus patterning 
there is required, in addition, the principle of the perseverative 
stimulus trace. 

When stripped of quantitative details, the basic logic of stimulus 
patterning is rather simple. The afferent impulses produced by 
the components of a dynamic stimulus compound are to some extent 
different when the component is acting '"alone,” i.e., in a relatively 
static combination, than when it is acting wdth the remainder of 
the dynamic compound. If a reaction is conditioned to the com- 
pound, the reaction potential of a given component, because of the 
generalization gradient, is less when it is acting separately than 
when in the compound. During the differential reinforcement 
which produces this kind of learning, the generalized excitatory 
potential of the components is extinguished, developing inhibition 
in proportion to the reaction strength of each component. This in- 
hibitory potential generalizes back upon the compound but, again, 
with a reduction due to the generalization gradient. The resulting 
net loss to the reaction potential at the command of the stimulus 
compound is much less than its original reaction potential; this 
ordinarily leaves the stimulus compound an amount of effective 
reaction potential which is well above the reaction threshold. This 
difference is the basis of the discrimination, i.e., of successful pat- 
terning. While the details of positive and negative simultaneous 
stimulus patterning and of positive and negative temporal stimulus 
patterning differ slightly, they all conform in substance to the sum- 
mary account just given. The process of genuine stimulus pattern- 
ing thus turns out to be at bottom the learning to discriminate 
afferent interaction effects. 

By an analogous use of the same set of postulates it is possible 
also to deduce the phenomenon not only of genuine stimulus pat- 
terning, but also of quasi or spontaneous stimulus patterning, and, 
other things equal, the amount of both kinds of patterning as an 
increasing fimction of the degree of the afferent interaction effects 
mutually generated by the components. Other deductions are to 
the effect that positive patterning will be easier to learn than 
negative, that simultaneous compounds will be easier to pattern 
than temporal, and that the longer the time interval separating 
iie components in a temporal stimulus compound, the more difficult 
will be the patterning. The same postulates afford a rather detailed 
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explanation of why a musical note to which a reaction has been 
conditioned fails to evoke the reaction when given in the midst of 
an arpe^o, the remaining notes of which have been made negative 
by differential reinforcement. All of these deductions are in sub- 
stantial agreement with such empirical observations as are at pres- 
ent available. 

If the organism could be certain of the occurrence of primary 
receptor discharges corresponding to every relevant condition in 
its environment which contributes to the determination of whether 
a given response will be followed by reinforcement, all responses 
in hi^-grade organisms might ultimately be made only to pat- 
terned stimulus compounds. But since under life conditions many 
elemente which determine reinforcement do not activate any recep- 
tor, fhe basis for complete and exact patterning is frequently lack- 
ing; the organism must accordingly gamble on the outcome, often 
with ifa very life at stake. However, the processes of biological 
evolution seem to have produced a fairly satisfactory non-pattemed 
arrangement for meeting this contingency. 

Analysis suggests that the magnitude of ordinary neural inter- 
action effects is such as to produce a fall of less than 60 per cent 
in the generalization gradient. Under these conditions the afferent 
impulse arising from a stimulus aggregate will tend to evoke the 
same reaction both alone and within the compound. When in the 
compound, the habit strengths (or reaction potentials) commanded 
by different stimulus aggregates presumably summate by a kind of 
diminishing returns principle. As a consequence, and quite apart 
from any patterning, the fewer the stimulus elements conditioned 
to a given reaction which chance to be present in a given stimulus 
compound, the smaller the reaction potentiality will be. This is 
believed to be a kind of crude but automatic biological calculus of 
tite probability that a reaction evoked xmder given circumstances 
will be followed by reinforcement. 

On the basis of the forgoing considerations we now formulate 
two imiwnl^t <K)rollari^: 

MAJOR COROLLARY IV 

reliffc^rcemecit ap^ed to caonultaneous stunulus com- 
i^^uRs In ike ^iteromg of such compoimds, eitiher positive 
Of to whetii^ the compottiid or die components are 
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MAJOR COROLLARY V 

Differential reinforcement applied to temporal stimuius compounds 
results in the patterning of such compounds, either positively or negatively 
according to whether the compound or the components are reinforced. 

NOTES 

The Formula for Calculating the Empirical Degree of Patterning in the 
Case of Reactions Which Are Not of the Ali-or-None Type 

The formula for P (Principle t), while applying fairly well to reactions of the 
aJl-or-none type, definitely does not apply to a response such as the galvanic 
skin reaction, or the salivary reaction, whose amplitude varies with the magnitude 
of the reaction potential. A distinctly unsuccessful attempt at a formula for 
this latter type of reaction tried out in an earlier study (4) was, 

p _ El + Eg 

Ei + * ’ 

where Ri is the amplitude of the reaction evoked by one stimulus component, and 
El is the amplitude of that evoked by a second. Ui^ortunately, the determinarion 
of a suitable formula for the calculation of the extent of empirical patterning 
with this type of reaction requires a knowledge of the physiological limit of the 
amplitude of its conditioned evocation; to find this would probably be a very 
laborious procedure (4 p. 108 ff.). 

The Patterning of Stimulus Compounds and the Configuration Psycholc^es 

After stud3?ing the above chapter the reader may naturally ask what the 
relation of the present behavioristic treatment of the configurational problem in 
learning is to that put forward by the Wertheimer branch of the Gestalt schooL 
While much might be said on this subject, the few words possible to devote to it 
in this place may help to clarify the reader's understanding. 

Gestalt Theorie asserts that configurations are not only Ic^caJly primaiy but 
that they are somehow primordiaL Indeed, if current configurationism is evar 
formulated as a true scientific theory, so that its primaiy and secondary piindpte 
can be clearly distinguished, it is rather likely that a statement asserting the 
reality and nature of configurations will be revealed as its sole primary principle 
or postulate. The present work, on the other hand, undertake to demonstrate 
that the response of organisms to stimulus configurations is logically s€NX)ndaiy, 
that it is the result of a rather complex process of learning which is ii[^<fiated by 
the behaviorally primary processes of (1) afferent neural interacrion, (2) 
severative stimulus trac^ (3) reinforcement, (4) genmliaation of reacrima 
potential, (5) experimental extinction, and (6) generalisation of inhibitaon. 

Gestalt writers frequently leave the impression that an ad^uate derivation 
of the reaction of organisms to stimulus configurations is a priori impassable 
from bcbavioristic or ndn-consciousness piindples. position of the prt^^t 
work is that such a derivation is not only posable, but rdatively rimpfe and 
straightforward. Moreover, the preceding pages have jmesented a numb^ of 
such deductions, thereby showing the Ge^aJt a prion dLaims to be ro fe r ta ka a . 
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Meanwhile it remains to be seen whether Gestalt Theorie can itself mediate com- 
parable deductions. Clearly, no dispute exists as to the genuineness or the 
importance of the patterning of stimulus compounds; the difference of opinion 
concerns, rather, the logical question of whether stimulus patterning is a primary 
or a secondary principle. There are, of course, other differences between Gestdli 
Theorie and the present approach, but they do not particularly concern us here. 

It is hoped that the derivation of the major phenomena of stimulus-pattern 
learning from objective, non-consciousness principles as demonstrated above, 
will contribute to the dissipation of current misunderstandings among psychob 
ogists, since these are a source of such deep and painful confusion to the scientific 
public. However, optimism in this connection is seriously dampened by the 
conviction that the differences involved arise largely from a conflict of cultures 
(d, pp. 18, 685 ; 7, p. 30) which, unfortunately, are extra-scientific and are not 
ordinarily resolvable by either logical or empirical procedures. 
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CHAPTER XX 


General Summary and Conclusions 

In the foregoing chapters we have made a detailed examination 
of much experimental evidence, and we have considered the merits 
of many alternative interpretations. Such complications, while 
unavoidable in a work of this kind, necessarily tend to obscure an 
integrated view in which the various components of the subject 
have their proper significance. No doubt the reader has sighed 
more than once for the simplicity of dogmatic aflSrmation and for 
the over-all perspective attainable by brevity. In the present 
chapter we shall endeavor to make a clarifying integration of the 
major conclusions scattered through the preceding exposition. 

THE NATIJEE OF SCIENTIFIC THEOKY 

The major task of science is the isolation of principles which 
shall be of as general validity as possible. In the methodology 
whereby scientists have successfully sought this end, two proced- 
ures may be distinguished — ^the empirical and the theoretical. The 
empirical procedure consists primarily of observation, usually facili- 
tated by experiment. The theoretical procedure, on the other hand, 
is essentially logical in nature; through its mediation, in conjunc- 
tion with the emplo3rment of the empirical procedure, the range of 
validity of principles may be explored to an extent quite impcH^ible 
by the empirical procedure alone. This is notably the case in 
situations where two or more supposed primary principles are pre- 
sumably operative simultaneously. The logical procedure yields a 
statement of the outcome to be expected if the several principle 
are jointly active as formulated; by comparing deduced or theo- 
retical conclusions with the observed empirical outcome, it may be 
determined whether the principles are general enough to cover the 
situation in question. 

Scientific theory in its ideal form eonsiste of a hiemrehy of 
cally deduced propositions which parallel aU tiie oteerved empirical 
relationshii^ competing a science. ITiis Ic^cal fracture is derived 
from a relatively small number of self-coiimstent primary prin- 
ciples called poskilates, when taken in conjunction with relevant 
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antecedent conditions. The behavior sciences have been slower 
than the physical sciences to attain this systematic status, in part 
because of their inherent complexity, in part because of the action 
of the oscillation principle, but also in part because of the greater 
persistence of anthropomorphism. 

Empirical observation, supplemented by shrewd conjecture, is 
the main source of the primary principles or postulates of a science. 
Such formulations, when taken in various combinations together 
with relevant antecedent conditions, yield inferences or theorems, 
of which some may agree with the empirical outcome of the con- 
ditions in question, and some may not. Primary propositions yield- 
ing logical deductions which consistently agree with the observed 
empirical outcome are retained, whereas those which disagree are 
rejected or modified. As the sifting of this trial-and-error process 
continues, there gradually emerges a limited series of primary prin- 
ciple whose joint implications are progressively more likely to 
agree with relevant observations. Deductions made from these 
surviving postulates, while never absolutely certain, do at length 
become highly trustworthy. This is in fact the present status of 
the primary principles of the major physical sciences. 

BEHAVIOR THEORY AND SYMBOLIC CONSTBTJCrS 

Scientific theories are mainly concerned with dynamic situa- 
tions, i.e., with the consequent events or conditions which, with the 
pa^ge of time, will follow from a given set of antecedent events 
or conditions. The concrete activity of theorizing consists in the 
manipulation of a limited set of symbols according to the rules 
expre^ed in the postulate (together with certain additional rules 
which make up the main substance of logic) in such a wa,j as to 
^an the gap separating the antecedent conditions or states from 
the mibsequent ones. Some of the S3anbols represent observable 
aiui measurable elements or aggregate of the situation, whereas 
others reprint presumptive intervening processes not directly sub- 
let to oteervation. The latter are theoretical constructs. All well- 
develof^ science freely employ theoretical constructs wherever 
they prove useful, sometimes even sequences or chains of them. The 
mmtific utility of lo^cal constructs consists in the mediation of 
valid deductioi^; this in turn is ab^lutely dependent upon every 
instruct chain, being securely anchored both on the 
iffite^ent imd <m the con^quent side to conditions or events 
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Fig. 84. Diagram summarizing the major symbolic constructs (encircled 
symbols) employed in the present system of behavior theory, together with 
the S3mibols of the supporting objectively ofc^rvable conditions and events. 
In this diagram S represents the physical stimulus energy involved in learn- 
ing; R, the organism^s reaction; I, the neural result of the stimulus; I, the 
neural interaction arising from the impact of two or more stimulus com- 
ponents; r, the efferent impulse leading to reaction; G, the occurrence of a 
reinforcing state of affairs; sHjt, habit strength; S, evocation stimulus on the 
same stimulus continuum as S; the generadized habit strength; Cd, the 
objectively ot^rvable phenomena determining the drive; D, the physiological 
strength of the drive to motivate acti<m; the reaction i)otentiaI; W, 
work involv^ in an evoked reaction; 1b, imctive inhibition; sis, (^nditioned 
inhibitimi; jgfe, effective reaction potential; sOs, oscillation; momentaiy 
effective reaction potential; sLs, ruction threshold; p, prdmbility of reaction 
evocation; d&, ktency of reaction evocation; n, number of unreinfort^ rela- 
tions to produce experiment extinction; and A, ampHtude of reaction. 
Above the symbofe the lines beneath the words remfort^ment, ffenera&ation, 
motivation, mM>Uicm, osf^ation, and rmponm ovocaikm indicate rmi^ly 
segment of the chain of eymbcdm witti whMi proe^ is 


which am direcUy If po^ble^ dicmld 

measursble. 

The theory of behaYkr to i^nwe tiie of a number 
of symbolic constructs, arrmiged for the mc^t part in a single 
chain. The main links of this chain are rep^resented in Figure SL 
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In the interest of clarity, the symbolic constructs are accompanied 
by the more important and relevant symbols representing the ob- 
jectively anchoring conditions or events. In order that the two 
types of symbols shall be easily distinguishable, circles have been 
drawm around the symbolic constructs. It will be noticed that the 
sj^mbols representing observables, while scattered throughout the 
sequence, are conspicuously clustered at the beginning and at the 
end of the chain, where they must be in order to make validation 
of the constructs possible. Frequent reference will be made to this 
summarizing diagram throughout the present chapter, as it reveals 
at a glance the groundwork of the present approach to the behavior 
sciences. 

organisms (xinceived as self-maintaining mechanisms 

From the point of view of biological evolution, organisms are 
more or less successfully self-maintaining mechanisms. In the 
present context a mechanism is defined as a physical aggregate 
whose behavior occurs under ascertainable conditions according to 
definitely statable rules or laws. In biology, the nature of these 
aggregates is such that for individuals and species to survive, cer- 
tain optimal conditions must be approximated. When conditions 
deviate from the optimum, equilibrium may as a rule be restored 
by some sort of action on the part of the organism; such activity 
is described as '^adaptive.” The organs effecting the adaptive 
activity of animals are for the most part glands and muscles. 

In higher organisms the number, variety, and complexity of 
the acts required for protracted survival is exceedingly great. The 
nature of the act or action sequence necessary to bring about opti- 
mal conditions in a given situation depends jointly (1) upon the 
state of disequilibrium or need of the organism and (2) upon the 
characteristics of the environment, external and internal. For this 
rea^m a prerequisite of truly adaptive action is that both the con- 
dition of the organism and that of all relevant portions of the 
environment must somehow be brought simultaneously to bear on 
tile reactive organs. The first link of this necessary functional 
rapport of the effector organs with organismic needs and environ- 
mmtal conditions is constituted by receptors which convert the 
biolc^caDy more important of the environmental energies (S) into 
mmral impils^ (s). For the mc^t part these neural impulses flow 
to In^n, which actB as a kind of automatic switchboard medi- 
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ating their efferent flow (r) to the effectors in such a way as to 
evoke response (i?). In this connection there are two important 
neural principles to be noted. 

The first of these principles to be observed is that after the 
stimulus (S) has ceased to act upon the receptor, the afferent im- 
pulse (s) continues its activity for some seconds, or possibly min- 
utes under certain circumstances, though with gradually decreasing 
intensity. This perseverative stimulus trace is biologically im- 
portant because it brings the effector organ en rapport not only 
with environmental events which are occurring at the time but with 
events which have occurred in the recent past, a matter frequently 
critical for survival. Thus is effected a short-range temporal inte- 
gration (Postulate 1, p. 47) . 

The second neural principle is that the receptor discharges and 
their perseverative traces (s) generated on the different occasions 
of the impact of a given stimulus energy (S) upon the receptor, 
while usually very similar, are believed almost never to be exactly 
the same. This lack of uniformity is postulated as due (1) to the 
fact that many receptors are activated by stimulus energies simul- 
taneously and (2) to '^afferent neural interaction.” The latter 
hypothesis states that the receptor discharges interact, while pass- 
ing through the nervous system to the point w^here newly acquired 
receptor-effector connections have their locus, in such a way that 
each receptor discharge changes all the others to a greater or le^ 
extent; i.e., s is changed to li, Ss, or Sgj etc., in accordance with 
the particular combination of other stimulus energies which is act- 
ing on the sensorium at the time (see Figure 84). This type of 
action is particularly important because the mediation of the 
responses of organisms to distinctive combinaticms or patterns of 
stimuli, rather than to the components of the patterns, is pri^um- 
ably dependent upon it (Postulate 2, p. 47). 

The detailed physiological principles whereby the nervous sys- 
tem m^ates tiie behavioral adaptation of the organism are as yet 
far from completely known. As a result we are forc«i for the mc^ 
part to get along as best we can with relatively coai^ molar formu- 
lations derived from conditioned-reflex and other behavior experi- 
ments. From this point of view it appears that the piwes^ of 
organic evolution have yielded two distinct but clo^ly related means 
of effective behavioral adaptation. One of these is the laying down 
of unlearned receptor-effector connections within the neural 

tissue w^hich wull directly mediate at least approximate behavioral 
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adjustments to urgent situations which are of frequent occurrence 
but which require relatively simple responses (Postulate 3, p. 66). 
The second means of effecting behavioral adjustment is probably 
evolution's most impressive achievement; this is the capacity of 
organisms themselves to acquire automatically adaptive receptor- 
effector connections. Such acquisition is learning. 

mmNING AKD THE PROBLEM OF ^BEINFORCEMEN-T 

The substance of the elementary learning process as revealed 
by much experimentation seems to be this: A condition of need 
exists in a more or less complex setting of receptor discharges 
initiated by the action of environmental stimulus energies. This 
combination of circumstances activates numerous vaguely adaptive 
reaction potentials mediated by the unlearned receptor-effector 
organization (sUb) laid down by organic evolution. The relative 
strengths of these various reaction potentials are varied from in- 
stant to instant by the oscillation factor (sOr). The resulting 
spontaneous variability of the momentary imlearned reaction poten- 
tial produces the randomness and variability of the unlearned 
behavior evoked under given conditions. In case one of these ran- 
dom responses, or a sequence of them, results in the reduction of 
a need dominant at the time, there follows as an indirect effect 
what is known as reinforcement (G, of Figure 84) . This consists 
in (1) a strengthening of the particular receptor-effector connec- 
tions which originally mediated the reaction and (2) a tendency 
for all receptor discharges (s) occurring at about the same time 
to acquire new connections with the effectors mediating the 
r^pon^ in question. The first effect is known as primitive trial- 
and-error learning; the second is known as conditioned-reflex learn- 
ing. In most adaptive situations both processes occur concur- 
rmtly; indeed, very likely they are at bottom the same 'process, 
differing only m the accidental circumstance that the first begins 
with an appreciable strength, whereas the second sets out from zero. 
As a imilt, whai the same need again arises in this or a similar 
^tuayon, the stimuli will activate the same effectors more c^- 
tainly, moi^ promptly, and more vigorously than on the first occa- 
mon. Such action, while by no means adaptively infallible, in the 
long run will r^uce the need more surely than would a chance 
^mfding of the unl^med respond teidencies (sUr) at the com- 
m^d of other nmi and stimulating situations, and more quickly 
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and completely than did that particular need and stimulating 
situation on the first occasion. Thus the acquisition of such 
receptor-effector connections will, as a rule, make for survival; 
i.e., it will be adaptive. 

Careful observation and experiment reveal, particularly with 
the higher organisms, large numbers of situations in which learning 
occurs with no associated primary need reduction. When these 
cases are carefully studied it is found that the reinforcing agent 
is a situation or event involving a stimulus aggregate or compound 
which has been closely and consistently associated with the need 
reduction. Such a situation is called a secondary reinforcing agent, 
and the strengthening of the receptor-effector connections which 
results from its action is known as secondary reinforcement. This 
principle is of immense importance in the behavior of the higher 
species. 

The organization within the nervous system brought about by 
a particular reinforcement is known as a habit; since it is not 
directly observable, habit has the status of a s3mbolic construct. 
Strictly speaking, habit is a functional connection between s and r; 
it is accordingly represented by the symbol sHr^ Owing, however, 
to the close functional relationship between S and s on the one 
hand, and between r and R on the other, the symbol bSjr wiU serve 
for most expository purposes; the latter symbol has the advantage 
that S and R both refer to conditions or events normally open to 
public observation. The position of in the chain of constructs 
of the present system is shown in Figure 84. 

While it is difficult to determine the qumititative value of an 
tmobservable, various indirect considerations combine to indicate 
as a first approximation that habit strength is a simple increasing 
growth function of the number of reinforcements. The unit ch€«s^ 
for the expre^ion of habit strength is calM the Aa6, a shorten^ 
form of the wmd ^^habit^'; a hab is 1 per cent of the physiological 
Mmit of habit strength under completely optimal conditior^. 

COOT>mOHS WWiCH IKFLXmNm THE MAOHITOIWi OF BLABTF 
iNcnEMiOT PER 

A more careful scrutiny of the condifions of reinfor^ment 
veals a number which are subject to variation, and eq>eriment8 
have shown that the magnitude of the habit incranmt (AgHj^) 
per reinforcement is dependent in one way or another ufKjn the 
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quantitative variation of these conditions. One such factor con- 
cerns the primary reinforcing agent. It has been found that, 
quality remaining constant, the magnitude of the increment of 
habit strength per reinforcement is a negatively accelerated in- 
creasing function of the quantity of the reinforcing agent employed 
per reinforcement. 

A second factor of considerable importance in determining the 
magnitude of AsHr is the degree of asynchronism between the 
onset of the stimulus and of the response to which it is being con- 
ditioned, This situation is complicated by whether or not the 
stimulus terminates its action on the receptor before the response 
occurs. In general the experimental evidence indicates that in 
case both the stimulus and the response are of very brief duration, 
the increment of habit strength per reinforcement is maximal when 
the reaction (and the reinforcement) occurs a short half second 
after the stimulus, and that it is a negatively accelerated decreas- 
ing function of the extent to which asynchronisms in either direc- 
tion depart from this optimum. In case the reaction synchronizes 
with the continued action of the stimulus on the receptor, the incre- 
ment of habit strength per reinforcement is a simple negative 
growth function of the length of time that the stimulus has acted 
on the receptor when the reaction occurs. 

A third important factor in the reinforcing situation is the 
length of time elapsing between the occurrence of the reaction and 
of the reinforcing state of affairs {G, Figure 84). Experiments 
indicate that this ^‘gradient of reinforcement^' is a negatively accel- 
erated decreasing growth function of the length of time that rein- 
forcement follows the reaction. The principle of secondary rein- 
forcement, combined with that of the gradient of reinforcement, 
explains the extremely numerous cases of learning in which the 
primary reinforcement is indefinitely remote from the act rein- 
fore^. A considerable mass of experimental evidence indicates 
that a kind of blending of the action of these two principles gen- 
erates a secondary phenomenon called the ^^goal gradient.” Upon 
empirical investigation this turns out to be a decreasing exponen- 
tial or negative growth function of the time (t) separating the 
reaction from the primary reinforcement for delays ranging from 
ten seconds to five or six minutes; delays greater than six minutes 
have not yet been suflGlciently explored to make possible a quan- 
titative statement concerning them. 

There are doubtlm other conditions which influence the magni- 
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tude of the increment of habit strength resulting from each rein- 
forcement. Those listed above certainly are typical and probably 
comprise the more important of them. An adequate statement of 
the primary law or laws of learning would accordingly take the 
form of an equation in which sHr would be expressed as a joint 
function not only of N but of the quantity and quality of the 
reinforcing agent, and of the temporal relationships of S to 72 and 
of i? to Ct. a formula which purports to be a first approximation 
to such a general quantitative expression of the primary laws of 
learning is given as equations 16 and 17, pp. 178-179. 

STIMULUS GEimiALIZATION' 

With the primary laws of learning formally disposed of, we 
proceed to the consideration of certain dynamical principles accord- 
ing to -which habits, in conjunction with adequate stimulation (S) 
and drive (D), mediate overt behavior. In this connection we note 
the fact that a stimulus (5, Figure 84)-, through its afferent im- 
pulses (s, represented in Figure 84) will often evoke the reac- 
tion (J2) even though s may be rather different from $ or I, the 
receptor impulse originally conditioned to jB. This means that 
when a stimulus (S) and a reaction (i?) are conjoined in a rein- 
forcement situation, there is set up a connection not only to the 
stimulus involved in the reinforcement but to a whole zone of other 
potential stimuli lying on the same stimulus continuum, such as 
Si, S 2 , Ss, and so forth. This fact, known as stimulus generaliza- 
tion, is of immense adaptive significance; since stimuli are rarely 
if ever exactly repeated, habits could scarcely function adaptively 
without it. 

Stimulus generalization has the characteristic that in general 
■the greater the physical deviation of S from S, the weaker will 
the habit stirength which is mobilized. More precisely, the strength 
of a generalized habit (sBs) is a linear increasing function of the 
sfarength of the habit at the point of reinforcement and a negatively 
accelerate decreasing function of the difference (d) between S 
and S as measured in di^iimination threrfK>lds Cjji.d.^s). Thus 
sHr is a theoretical construct anchored to the consbruct 
to the observables S and S (see Figure 84) . 

Stimulus generalization appears to take two forms — (1) quali- 
tative stimulus generalization and (2) stimulus intensity general- 
ization; another way of stating the same thing is to say that each 
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more rapid than is the ordinary loss of learning effects. This would 
produce the phenomenon of reminiscence which has been especially 
studied in rote learning. 

Since the presence of Ir constitutes a need, the cessation of 
the activity which generated the need would initiate the need- 
reduction process; but since need reduction is the critical element 
in reinforcement, there follows with fair plausibility the molar 
principle that cessation of the activity would be conditioned to any 
stimuli which are consistently associated with such cessation (Pos- 
tulate 9, p. 300). But a tendency to the cessation of an act would 
be directly inhibitory to the performance of that act. Therefore 
the inclusion of such an inhibitory stimulus in a stimulus com- 
pound, the remainder of which is positively conditioned to the 
response, would tend to prevent the evocation of the response in 
question; this is, in fact, the ordinary empirical test for conditioned 
inhibition [bIr, Pigure 84) . 

On the above view that rIr is a negative habit, the injection of 
alien stimuli into the stimulus compound would, through the prin- 
ciple of afferent interaction, produce disinhibition, i.e., a temporary 
reduction or total abolition of rIr, But on the assumption that 
sIr is being set up during the process of accumulating Ir, it follows 
that the total inhibition {Ir) at the conclusion of experimental 
extinction must be in part Ir and in part For this reason 
disinhibition will take place only in so far as Ir is composed of 
sIr. and spontaneous recovery will take place only in so far as Ir 
is composed of this means that neither disinhibition nor spon- 
taneous recovery can ever restore an extinguished reaction poten- 
tial to its full original strength. Other implications which flow 
from the above ^sumptions are that there is greater economy in 
distributed than in massed repetitions in rote learning, and that, 
other things equal, organisms receiving the same reinforcement fol- 
lowing two respons<^ which require different energy expenditures 
will, as practice continues, gradually come to choose the less labori- 
ous r^onse. This is the 'flaw of less work.” 

Implicit in the preceding discussion has been the assumption 
that the reaction potential actually available for reaction evoca- 
tion, i.a, the effective reaction potential {^Ry Figure 84) , is what 
remains of ihe reaction jK)tential {bEr) after the subtraction of the 
total inhibition, Ir; i.e., 


^R = ^R Ir, 
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Since both and /js are anchored to objectively observable ante- 
cedent conditions, it follows that is also thus anchored. 

THE OSCILLATION OF EFFECTIVB REACTION POTENTIAL 

At this point it must be noted at once that the full value of 
is rarely brought to bear in the evocation of action. Instead 
it is subject to random or chance downward variability. These 
fluctuations are believed to be due to a little-understood physio- 
logical process which has the power of neutralizing reaction poten- 
tials to degrees varying from moment to moment. Because of this 
latter characteristic, the process is called ^^oscillation”; it is repre- 
sented by the symbol sOr. Effective reaction potential as modified 
by oscillation is called “momentary effective reaction potential”; 
this is represented by the symbol 

Since bOr is not directly observable, it has something of the 
status of a symbolic construct; on the other hand, owing to its pre- 
sumably constant value, it has less elusiveness than an ordinary 
construct; it is therefore not placed in a circle in Figure 84. The 
hypothetical characteristics of sOb ^^7 be listed as follows: 

1. It is active at all times. 

2. It exerts an absolute depressing action against any and all reaction 
potentials, whether great or small. 

3. The magnitude of this jwtentiality varies from instant to instant 
accordmg to the normal probability distribution. 

4. The magnitude of its action on different reaction potentials at a 
given instant is uncorrelated (Postulate 10, p. 319). 

Since oscillation is continuously active all ruction poten- 
tials, it plays a very great role in adaptive bdhavicnr. It pre- 
sumably is responmble for many erf the phenomena grouj^ ly ihe 
Clascal i^chol<^isls und^ ”Bie b^d of "attentiem.” It is in 
large measure r^)oimWe i<xt the fact that the ^haJ ^ien^ mu^ 
p<K)I mmj ol^rvatioQS bef<xpe orefinary empirical laws may b^nae 
manifest, llius nataial laws in the social teienees mu^ always 
he bas^ on stafetical indict of om kind or another. TMb in 
its turn has induced much preoceupaiion with statistical methc^ 
on the part of the various behavior science. The nm^ity <rf 
pcKjling large numbers of ol^rvations in older to i^lato rapirical 
laws has greatly increa^i the labor a^Kjiated with empirical in- 
ve^gations and has doubtl^ appr^iably retarded the develop- 
m^t of the behavior ^imum 
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THE REACTION THRESHOLD AISB RESPONSE EVOCATION 

The anchoring on the posterior or consequent side of our chain 
of behavioral constructs culminating in as shown in Figure 84, 
lies in the evocation of observable reactions. In the determination 
of the functional relationship of to the various measurable 
phenomena of responses, we encounter special diflSculties owing to 
the fact that sEb is itself not directly observable. If we were 
quite sure of the quantitative functional relationship of sEb to its 
combination of antecedent anchors, the value of sEb could be cal- 
culated in empirical situations and equations could then be fitted 
to the relationship of these numbers to the corresponding response 
values; these equations are what we seek. Unfortunately the nec^- 
sary antecedent functional relationships are not yet known with 
sufficient certainty. 

It happens, however, that in t3rpical sets of simple learning 
results, employing the four measurable response phenemona, the 
fitted equations in all cases are easily and naturally expressible 
by equations involving the simple positive growth (exponential) 
function of the number of reinforcements (iV). This tends some- 
what to confirm the soundness of the general growth hypothesis 
of the relation of iV to bHb, and so of N to bEb- Further inde- 
pendent confirmation of the soundness of the growth hypothesis 
of the relation of sEb to N, lies in the following fact: when the 
probability-of-reaction-evocation type of learning curve is ana- 
lyzed theoretically, it turns out to be yielded in a degree of detail 
scarcely attributable to chance on the above assumption of the 
relation of sEb to N coupled with two additional assumptions, 
each well supported by independent evidence — ^that of the oscil- 
lation function (sOr) and that of the reaction threshold (sEb) 
(see Figure 84). The characteristics of the oscillation function 
have been summarized above. Moreover, the concept of the reac- 
tion threshold is well established, since notions fairly comparable 
to it have long been current in classical psychophysics and in 
physiology. As here employed, the reaction threshold (sLb) is 
th^ minimal amount of momentary effective reaction potential 
(sEr) which is nece^ary to mediate reaction evocation when the 
sitaation is uncomplicated by competing reaction potentials (Pos- 
telate 11, p. 344). 

Acting, then, on tibe fairly well-authenticated growth hypothesis 
of the rekticm of sEr to V, it is a relatively simple matter, by 
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inspecting the equations fitted to concrete example of the tiitee 
remaining types of learning curves and utilizing the method of 
residues, to determine the fimctional relationship of to the 
particular behavior phenomena employed. As a result of this pro- 
cedure it is concluded that probability of reaction evocation stands 
in an ogival relationship to effective reaction potential (Postulate 
12, p. 344) ; that reaction latency stands in a negatively accelerated 
inverse relationship (Postulate 13, p. 344) ; and that both resist- 
ance to experimental extinction and reaction amplitude (of auto- 
nomically mediated responses) are increasing linear functions of 
sWb (Postulates 14 and 15, p. 344). 

A final complication concerning reaction evocation aris^ from 
the fact that often the stimulus elements impinging on the receptors 
at a given instant may mobilize superthreshold reaction potentials 
to several different reactions, some or all of which may be mutually 
incompatible. In such cases all but the strong^ will nec^sariiy 
suffer associative inhibition (Postulate 16, p. 344). There are al^ 
some indications that the dominant potential itself may suffer a 
certain amount of blocking; indeed, this is the basis of the most 
plausible theory of ^Torgetting^^ now available. 

This concludes our summary of primary principles. All of the^ 
principles are also statable in the form of quantitative equatiom. 
This means that if the antecedent conditions S, s, R, G, t, S, I, 
Cd, Wj sObj and sLb were known, it would be possible to compute 
P, sts? or A by substituting appropriately in a succession of th^ 
equations beginning on the left-hand side of Kgure 84 and pro- 
ceeding toward the right. For example, the calculation of 
would employ equations 16, 34, 44, 45, and 4&. 

-DYNAMICS OF STIMULUS <X)MK)UNI^ AND PATUSNS 

For the most part the molar principles outlined in the preen- 
ing chapters are presumably primary in nature, though occasitmal 
secondary principles have been pr^mtn. Because of their rela- 
tively primitive status in the logical hierarchy of the ^stem md 
of their specially intimate relation to survival, a few ^eondary 
principle or mechanisms have beai ^>nal c^asideration 

and have bean listed as ^^major coroUari^/^ One of th^ concerns 
the quantitative summation of the reaction potentials mobilized by 
the several stimulus components of a stimulus <x>mpound, and the 
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other concerns the matter of stimulus patterning. We shall first 
take up the matter of the summation of reaction potentials. 

In spite of the presumptive fact of afferent neural interaction, 
the afferent discharge of each receptor contains a large amount of 
similarity regardless of the influence of other stimulus elements 
(and receptors) which may be active at the time. This means 
that any stimulus component conditioned to a reaction will ordi- 
narily command an appreciable potentiality to that reaction regard- 
less of the other stimuli accompanying it. Now, according to the 
primary law of learning, each individual receptor discharge bears 
its load of habit strength, and so of reaction potential. The reac- 
tion-potential loadings thus borne by the several receptor dis- 
charges initiated by the different stimulus elements of a stimulus 
compound presumably combine quantitatively in the same way 
as do the different increments of habit strength, i.e., not by a simple 
addition but according to a kind of diminishing-retums principle. 
Thus if two stimulus aggregates bearing equal loads of reaction 
potential to the evocation of the same response are acting simul- 
taneously as a stimulus compound, their physiological summation, 
quite apart from afferent interaction effects, will be less than the 
arithmetical sum of the two reaction potentials; similarly, if one of 
the two equally loaded stimulus aggregates making up a stimulus 
compound should be withdrawn from the compound, more than 
half of the total reaction potential would remain. As a result 
(except for afferent neural interaction effects), the more completely 
a reinforced stimulus compound is repeated on a subsequent occa- 
don, the more likely it will be to evoke the reaction in question. 

This mode of action has special adaptive significance, because 
the more completely the stimulus compound is repeated, the more 
mmilar will be the environmental situation in general to the situa- 
tion in which need reduction originally occurred, and therefore the 
more probably will the response in question lead again to a reduc- 
tion in the need. Here we have a primitive automatic mechanism 
which in effect roughly gauges the probability of a given stimulus 
situation^ yielding need reduction in case a given response is 
evok^. This adaptive mechanism has the great advantage of 
being instantly available at the presentation of any stimulus situa- 
tion, novel or otherwise. 

The other mjondary principle mediating the rei^nse of organ- 
^ms to stimulus compounds, winch we have included in the present 
work, is tiiat known as patterning. This operates concurrently 
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with the summation principle just discussed but is much slower in 
its action. However, given sufficient time for the rather difficult 
learning process to take place, stimulus patterning may be very 
precisely adaptive. It is a fact that in very large numbers of 
situations the question of whether or not a given response will be 
followed by reinforcement depends upon the presence or absence 
of a particular combination of physical circumstances and so, for 
the organism, upon a particular combination or pattern of stimulus 
elements, rather than upon the presence or absence of any of the 
components. Since each combination of stimulus elements will 
modify to some extent the afferent impulses produced by each 
stimulus component, any change in the stimulus compound wdll 
also modify to some extent the afferent responses initiated by all 
the remaining stimulus components. In the process of the irregular 
alternation of reinforcement and extinction called differential rein- 
forcement, which is characteristic of the form of trial and error 
known as discrimination learning, higher organisms are able to 
emerge with one response successfully conditioned to one combina- 
tion of stimuli and with a quite different response successfully 
conditioned to another combination of stimuli containing many of 
the components of the first, provided some of the elements are dif- 
ferent, At bottom this discrimination is possible because the 
afferent impulse St which arises from the stimulus element when 
occurring concurrently with the stimulus element Sg, is to some 
extent different from I3, which arises from the same stimulus ele- 
ment, Si, wffien occurring concurrently with a different stimulus 
element, Ss. The physiological summation of the several com|X>- 
nent reaction potentials characteristic of various stimulus patterns 
which have many, and even most, of their stimulus elements in 
common, accordingly may result in the evocation without can- 
fusion of the distinctive reaction conditioned to each. Thus each 
of the forty or so elementary speech sounds is a fairly distinctive 
pattern made up of a “fundamental” physical vibration rate and 
a particular combination of higher partials. Each of ihe thousands 
of words of the better-developed languages consists of a temix>rally 
patterned sequence of these elementary speech sounds, stops, and 
so forth. In reading, each letter is a cmnpkx visual pattern, each 
word is a complex pattern of th^ letter patterns, and each ^ntence 
is a temporally patterned sequence of printed word patterns. In- 
deed, it is imjx>ssible to think of a life situation which is not pat- 
terned to a considerable extent. The limiting case of this kind of 
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learning is that in which a stimulus compound is conditioned to 
evoke a reaction while the several components when acting alone 
are consistently extinguished. 


A FORWARD GIAKCE 

The main concern of this work has been to isolate and present 
the primary or basic principles or laws of behavior as they appear 
in the current state of behavioral knowledge; at present there have 
been isolated sixteen such principles. In so far as these principle 
or postulates are sound and sufiBcient, it should be possible to 
deduce from them an extensive logical hierarchy of secondary prin- 
ciples which will exactly parallel all of the objectively observable 
phenomena of the behavior of higher organisms; such a hierarchy 
would constitute a systematic theory of all the social sciences. Con- 
siderable progress has been made in this direction (I, 2, S, 4, 

7, 8, 9, 10, 11, 12, IS, U, 15, 16, 17, 18, 19, 20, 22, 23, 24, 35, 26, 
27, 28, 29), though because of the limitations in available space 
only a random sampling of some fifty or so secondary principles 
(corollaries) is included in the present volume; these are given 
chiefly for purposes of illustrating the meaning of the primary prin- 
ciples. 

As the systematization of the behavior sciences proceeds, some 
of the principles put forward above as primary will be found to 
yield false deductions and will therefore be abandoned; some will 
be discarded as primary principles because found derivable from 
other primary principles and consequently will be placed in the 
group of secondary principles; others will be found partially defec- 
tive and will require modification; finally, entirely new postulate 
win need to be added. The primary principles presented in the 
pr^eding pag^ have been formulated with the certainty of these 
future developments fully in mind. A sharp and definite formula- 
ticm h^ in many cases, been ^ven principles despite admitted 
doubt as to their precise validity. It is believed that a clear formu- 
lation, even if later found incorrect, will ultimately lead more 
quickly and ^^ly to a correct formulation than will a pussyfoot- 
ing staten^t which might be more difficult to convict of falsity. 
The primary of a science is the early and economical discovery 
of ite base laws. In the view of the ^ientifically sophisticatol, 
to no^e an ineorr^ ga^ whose error is e^ly detected should be 
im diagram; ^imtifie dlscov^ is in part a trial-and-error proee^ 
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and such a process cannot occur without erroneous as well as suc- 
cessful trials* On the other hand, to employ a methodology by 
which it is impossible readily to detect a mistake once made, or 
deliberately to hide a possible mistake behind weasel words, philo- 
sophical fog, and anthropomorphic prejudice, slows the trial-and- 
error process, and so retards scientific progress. 

It is to be hoped that as the years go by, systematic treatises 
on the different aspects of the behavior sciences will appear. One 
of the first of these would naturally present a general theory of 
individual behavior; another, a general theory of social behavior. 
In the elaboration of various subdivisions and combinations of 
these volumes there would develop a systematic series of theoretical 
works dealing with different specialized aspects of mammalian be- 
havior, particularly the behavior of human organisms. Such a 
development would include volumes devoted to the theorj^ of skills 
and their acquisition; of communicational S3T]Qbolism or language 
(semantics) ; of the use of s:^Tnboiism in individual problem solu- 
tion involving thought and reasoning; of social or ritualistic sym- 
bolism; of economic values and valuation; of moral values and 
valuation; of aesthetic values and valuation; of familial behavior; 
of individual adaptive efficiency (intelligence) ; of the formal ^u- 
cative processes; of psychogenic disorders; of social control and 
delinquency; of character and personality; of culture and accul- 
toation; of magic and religious practices; of custom, law, and 
jurisprudence; of politics and government; and of many other 
^ecialized behavior fields. 

As a culmination of the whole there would finally appe^ a 
work consisting chiefly of mathematics and mathematieal Ic^c* 
This would set out with a list of undefined terms or signs whc^ 
referents are publicly available to the observation of all normrf 
persons; such terms, because they can be directly conditioned to 
the referents by differential reinforcement, should have a minimum 
of ambiguity* From these undefined notions would be s3mtheriE6d 
by the incomparable technique of s3?mbolie logic all the critical 
concepts required by the system, for correct primary concepts are 
just as important for valid systematization in lienee as are corr^t 
primary principles; this should yield a complete set of wholly un- 
ambiguous terms. From these terms or signs would be formuiatoi 
precise mathematical statements of the several pcstuiates or pri- 
mary molar principle which survive "Sie intervening winnowing 
proc^^, together with such other principle as it may be found 
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necessary to introduce; from these, by means of rigorous mathe- 
matical processes, would be derived theorems paralleling all the 
empirical ramifications of the so-called social sciences. Also there 
should be derivable large numbers of theorems concerning the out- 
come of situations never yet investigated; this latter group would 
make possible practical behavior applications and social inventions. 

If one may judge by the history of the older sciences, it will 
be a long time before the “social” sciences attain a status closely 
approximating that contemplated here. Nevertheless there is rea- 
son to hope that the next hundred years will see an -unprecedented 
development in this field. One reason for optimism in this respect 
lies in the increasing tendency, at least among Americans, to regard 
the “social” or behavioral sciences as genuine natural sciences 
rather than as Oeisteswissenschaft. Closely allied to this tendency 
is the growing practice of excluding theological, folk, and anthropo- 
morphic considerations from the list of the presumptive primary 
behavioral explanatory factors. Wholly congruent with these ten- 
dencies is the expanding recognition of the desirability in the 
behavior sciences of explicit and exact systematic formulation, 
with empirical verification at every possible point. If these three 
tendencies continue to increase, as seems likely, there is good reason 
to hope that the beha-vioral sciences will presently display a devel- 
opment comparable to that manifested by the physical sciences in 
the age of Copernicus, Kepler, Galileo, and Newton. 

But we should not deceive ourselves. The task of systemati- 
cally developing the beha-vior sciences will be both arduous and 
exacting, and many radical changes must occur. Behavior scien- 
■(asts must not only learn to read mathematics understandingly — 
they must learn to think in terms of equations and the higher 
mathematics. The so-called social sciences will no longer be a 
division of belles lettres; anthropomorphic intuition and a brilliant 
style, desirable as they are, will no longer sufiBce as in the days 
of W^illiam James and James Horton Cooley. Progress in this 
new era will consist in the laborious "writing, one by one, of hun- 
dreds of equa-tions; in the experimental determination, one by one, 
of hundreds of the empirical constants contained in the equations; 
in the de-vising of practically usable uni-ts in which to measure the 
quantitira expr^sed by the equations; in the objective definition 
of hundreds of symbols appearing in the equations; in the rigorous 
deduetiosi, one by one, of thousands of theorems and corollaries 
bpom the pimaiy definitions and equations; in the meticulous per- 
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formance of thousands of critical quantitative experiments and 
field investigations designed with imagination^ sagacity, and daring 
to test simultaneously the validity of both the theorems and the 
primary principles and concepts from which the former have been 
derived; in the ruthless discard or revision of once promising pri- 
mary principles or concepts which have failed wholly or in part 
to meet the test of empirical validation. 

There will be encountered vituperative opposition from those 
who cannot or will not think in terms of mathematics, from those 
who prefer to have their scientific pictures artistically out of focus, 
from those who are apprehensive of the ultimate exposure of cer- 
tain personally cherished superstitions and magical practices, and 
from those who are associated with institutions whose vested inter- 
ests may be fancied as endangered. 

This great task can be no more than begun by the present 
generation of workers. Hope lies, as always, in the oncoming 
youth, those now in training and those to be trained in the future. 
Upon them rests the burden of the grinding and often thankl^ 
labor involved, and to them must rightfully go the thrill of intel- 
lectual adventure and the credit for scientific achievement. Per- 
haps they will have the satisfaction of creating a new and better 
world, one in which, among other things, there will be a really 
effective and universal moral education. The present work is pri- 
marily addressed to them. 
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Nde: The literal signs are arranged primarily in the alphabetic^ order 
of the major letter co^tituting the sign, and secondarily according to die 
subscripts. The non-literal s%ns are grouped at the end "of the list. 

A = amplitude, magnitude, or intensity of a reaction; A = Ysfeg - 

a = empirical eiqponential constant in the equation expiring die 
^neralized or effective habit strength as a function of sEg and 
d, i.e., 

sfffi = sHsT'^. 


d = empirical constant in the ecpiation, d-R 


sEJ-' 


B = empirical constant in the equation, is = 

B — W 

of 

V = empirical constant in the equation, — , 

jOj, = temporal coincidence of a receptor impulse (§) and the b^inning 
of a reaction impulse (r). 


c 

Cd 


empirical constant in the equation, Ir 


cn 

-F’ 


conditions which produce the drive (D), the obiective conditions 
from which D may be calculated. 


D = strength of dominant primary drive operative in tlie i^ioiary 
motivation to action after the formation of the habit invdved. 


ly = stren^h of primary drive (D) operative during the formatim d 
a habit. 

D = the pint strength of all the non-dominant drives active at a grra 
moment. 


D + Md 


d = the number of j.n.d."s lying between the two stimulus 
S and jS. 


D-D. 

e = a mathematical constant properly having a value d 2.7183 Imt 
here frequently given the value of 10 because more convenient 
in u^ where Ic^arithms are involved. 
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= excitatory potential, potentiality of reaction evocation; i.e* 
D + D 
jD Md 

sEr = effective reaction potential, i.e., sEr— sEJr — Ir- 


S^R — Si -i- X -r- 


= momentary effective reaction potential; as modified by 

F = the constant factor of reduction of the unrealized potential habit 
strength under given learning conditions. Thus if at each rein- 
forcement the unrealized potentiality of habit strength is reduced 
by 1/10, F has a value of .1. 


F' = amount of force. 


/ = an unstated quantitative functional relationship, e.g., 

A = .141 sHb + 3.1 may be written, A = i.e., A is a, 

function of 

f = empirical constant in the equation, n = c' sEr — 

G = a. need reduction or a stimulus which has been closely associated 
with a need reduction; primary reinforcement; also a primary 
goal reaction. 

g = fractional portion of a goal reaction which may be split off from 
G and carried forward in a behavior sequence as a fractional ante- 
dating goal reaction. 


sHr — habit strength conceived as a rough or approximate stimulus- 
response relationship to sSf 

sE ie == the habit strength (sE r) which results from N reinforcements. 


JEj. = habit strength conceived as a precise dynamic relationship be- 
tween afferent and efferent neural impulses. 


ArEr = increment of habit, strength resulting from a single reinforcement. 
sEr = effective habit strength: rEr^ rEret^'^^ 


ssSr = %+ s^R, i-^-) the result of the summation of the habit strengths 
asociated with two or more stimulus elements or 


k = number of hours, as of food privation. 

¥ = anpiri<^ constant in the equation, A = ¥sEr — z'. 

== ^nount of reaction inhibition. 

Is = totel amount of inhibitory potential, i.e., Ir - Ib+ rIr* 
Jr = amount of conditioned inhibitory potential. 
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= amount of reactive mhibition t units of time after a given sequence 
of reaction evocations, i.e., 

-IrX 

• tlie increment of reactive inhibition generated at a sin^e reaction 
evocation, 

the exponential constant in a learning situation where = 
m — me ^ . The quantitative value of i is given by the equation. 


' = empirical constant in the equation, A == k'sEjt — f'. 

! = empirical constant in the equation expressing the TnaYimnm habit 
strength as limited by the delay in reinforcement (0 ; 

m' = 

= empirical constant in the equation expiring the stimulus gener- 
alization of habit strength, 

sEr = 

= discrimination threshold — ^the distance on a generalization 
tinuum between two stimuK which, at the limit of practice, can be 
reacted to differentially on 75 per cent of the trials. 

== empirical constant in the equation expr^sing the maximi iTn habit 
strength as limited by the quality and quantity of the reinforck^ 
agent employed per reinforcement, i.e., 

M' = M(1 - 

= distance (length) of movement. 

: = reaction threshold, the minimal amount of effective imctiom 
potential (sEr) (the effect of c^illation being at a minimum^ 
that will mediate reaction evocation, 

= logarithm. 

= the physiological maximum of habit strength attemable uncter 
optimal conditions. This value is taken as 100 h^3s. 

= the mayininTTY of habit strength with unlimited practice as limits 
by the amount and quality of the reinforcing agent, i.e., 

M' = Mil - 10-^). 

= the physiological maximum of drive (100 mobs), 

== the physiological maximum of reactive mhibition (KX) pavs). 
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m = the physiological maximiim of habit strength, attainable with ud- 
limited^practice, as limited by the asynchronism of S and 22 in the 
reinforcement situation, e.g., 

m = me or m = me “ . 

m' == the maximum of habit strength, with unlimited practice, as limited 
by the delay in reinforcement, i.e., 

m' = M 

"N == the number of reinforcements. 

n == the number of unreinforced reactions required to produce experi- 
mental extinction, i.e., 

c 

0 = a function of sOs such that 0 = sOs when but 

O = sHfiWhen sO^ > 

== the oscillatory weakening potentiality associated with effective 
reaction potential 

P ss coefficient of observable patterning, i.e., P = Q — Q. 

= theoretical index of patterning, i.e., P' » lOO — 

p = probability of reaction evocation. 

Q = per cent of empirical reaction evocations by the positive or rein- 
forced phase of a stimulus compound. 

Q — per cent of empirical reaction evocations by the negative or non- 
remforced phase of a stimulus compound. 

= theoretical effective reaction potential of the negative portion of 
a stimulus patterning situation. 

Q = th^retical effective reaction potential of the positive portion of a 
stimulus patterning situation. 

q == empirical exponential constant in the equation of the dissipation 
of/^ X 

P = (1) reaction or response in general (muscular, glandular, electri- 
cal); 

(2) more specifically, the reaction which occurs as the result of pre- 
vkms conditioning. 

P = a reacticm whidi ^ in the pitx^^ of being conditioned to a 
s^hnulus. 

^ ss: response such as the flow of saliva in a Pavlovian 

<50ffiditicmed r^fex experiment. 
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= response to a conditioiied stimuliis before mndfi f mnTTig b^ins* 

(S) = a 2 :^ 5 K>nse wMcb. is either unobservable or exc^^^iingiy f^Efble and 
difScidt of observation. 

S == 1. stimulus energy in general, e.g., the energy of sound, Hght, or 
heat wav^, pr^ure, eto. 

2. more sj^ificaily, stimulus energy T^hich evok^ a r^ponse on 
the b^is of a previously formed habit. 

Sa ~ stimulation arising from apparatus employ^ in an eiperimeat. 

Sd = drive stimulus, Le., stimulation arising from a condition of n^ 
or disequilibrium. 

Sc = conditioned stimulus such as the buzsjer in a Pavlovian <x>n<ii- 
tioned-refiex experiment. 

Su = unconditioned stimulus such as Ibe food of the Pavlovian condi- 
tioned-reflex experiment. 

jS = a stimulus when considered as in the proc^ of being a>nditK>i^ 
to a reaction. 

s = afferent neural impulse resulting from lie action of a stimulus 
energy on a receptor, os 

= drive stimulus receptor discharge. 

Sc = afferent impulse arising from tiie action of a conditioned stimulus 
on a receptor. 

8a = afferent impulse arising from the action on a receptor of a stimulus 
energy arising from an apparatus employed in a learning situatkm. 

s = an afferent neural impulse wh^ considered as in the pnx^ erf 
being conditioned to a reaction. 

S = an afferent neural impulse as modified by afferent neural inter- 
action. 

A(s > r) or A(S >R) = increnent to a receptor-effecfcmr cchsb^^iu 

T = the time at which an instantaneous event o<X5urs. 

Tc = the time of the banning of the jR of a gC^ 

Tb = the time of the b^mning of a reaction. 

Ts = the time of the hymning of a stimulus which is in of 

being conditioned. 

Tq = the time at which a renforc^n^t occurs. 

t = (1) time in the sense of duration; 

(2) the duration of the delay in r^nforcemeit, Tc ^ Tq. 
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ff ^ Ts — .66, when S acts continuously and overlaps the begm- 
ning of where Tr and Ts are given in seconds. 

t" = Tr — Ts — .44, when S and R are practically instantaneous, 
where Tr and Ts are given in seconds. 

t"' = the duration in minutes following a sequence of unreinforced 
evocations of R during which neither reinforced nor unreinforced 
evocations of R have occurred. 

= the latency of a reaction evocation, the time intervening between 
the beginning of the stimulus and the beginning of the response. 

sUr = unlearned or native receptor-effector reaction potential. 

sUr = momentary unlearned reaction potential. 

u = empirical exponential constant in the equation expressing the 
maximum habit strength as limited by the time a stimulus (S) 
has been continuously acting when R occurs, i.e., 

m = 

V = empirical exponential constant in the equation expressing the 
maximum habit strength attainable with unlimited reinforcement 
as limited by the degree of S—R asynchronism, i.e., 

m = 

W = the amount of work, i.e., W = F'L. 

w = the magnitude of a reinforcing agent employed as in the equation, 
M' = ikf(l - e-^). 

A = increment, e.g., ^s^r. 


e = used in mathematical logic and read as e.g., xeS is read, 
is 


S = tbe sum of a ^ri^, as HiAsSr, which means the sum of the incre- 
ments of a habit strength resulting from a series of reinforcements. 

^ s^B “ standard deviation of the oscillation of reaction potential. 

= tibe standard probability function. 





• = a sogn u^ in mathematical Ic^c meaning “and.^^ 

3 = a s%n of implication used in mathematical logic and read, '^if , 

; e.g., xZD y is read, *'If Xj then y,” i.e., x implies y. 
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a = a sign used in mathematical io^c, e.g., (3z), and r^, is 

an X such that • . . 


+ = physiological summation, e.g., sSsi4- sHsf = 

8^Mi + ““ 


sRmi X 

ioo 


> =: greater than, e.g., 5 > 4. 
< = less than, e.g., 4 < 5. 


— > = a causal receptor-effector relationship inherits or at hmt in 
functional condition at the outset of a learning situation. 


> = a causal receptor-effector relationdiip which is acquire by tim 

organism. 

= a causal relationship other than that of a receptor-eff^tor coni^- 
tion. 
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of, 165 f. 

Configuration, 44, 48 f. 

Consequent conditions: in behavior 
theory, 382 f. 

Cyclic-phase conditioned reaction, 
165 f. 


Delay of reinforcement: affecting 
preference for act, 146 f.; Ander- 
son’s study, 148 f.; corollaries, 147 f., 
151 f., 154, 157; early attacks on 
prc^lem, 136 f.; and habit strength, 
135 f.; Hamilton’s study, 136 f,; 
origin of problem, 135 f.; Perm’s 
experiment, 139 f., 161 f.; rate of 
di^nimination and, 152 f.; reconcili- 
ation of experimental paradoxes, 
140 f.; relation to Weber’s Law, 
154 f.; Warden and Haas’ study, 
1^; Watson’s study, 136; Wolfe’s 
eaq^riment, 138, 160 f.; Yoriiioka’s 
study, 153 f. 

Detainination of action: Elliott’s 
study, 231 f.; Perm’s study, 227 f.; 
rote of drive in, 226 f.; role of habit 
strengtii in, 226 f.; Stone’s expm- 
ment,231; Williams’ study, 227 f. 

Differen^d reactions: corollary of, 
^1; Hull’s inv^gation, 233 f.; to 
kientilcal ^vironm^tal sibmtions, 
233 f.; Leer’s study, 234i. 

Kffirarential reinforcement: and n^a- 
Mve pat^ming, 363 f.; and positive 


Disinhibition: corollaries, 288 f., 293; 
and curve of extinction, 291 f,; of 
extinction effects, 272 f.; Hovland’s 
study, 291 f.; inhibition and, 392; 
phenomenon of, 287 f.; simultane- 
ous, 273; Switzer’s study, 291; Za- 
vadsky’s study, 272 f. 

Doctrine: of “emergentism,” 26 f. 

Drives: aspect of motivation, 131; as 
intervening variables, 57 L, 66 f.; 
and need, 57 f.; Postulate VIII, 
300; primary, 59 f. 

Drugs: influence on experimental ex- 
tinction, 236 f.; Miller and Miles’ 
investigation, 237; Skinner and 
Heron’s study, 237 ; Switzer’s study, 
236 


Effective habit strength: concept of, 
1871; Major Corollary 11, 253; os- 
cillation of, 308 f., and probability 
of reaction evocation, 308 f.; and 
reaction-evocation power, 210 f. 
Effective reaction potential: ampli- 
tude of reaction and, 339 f.; concept 
of, 2811; corollaries, 2841, 2891, 
297; as function of reaction latency, 
3361; incompatible reaction poten- 
tials, 341; and inhibition, 277 f.; 
Postulates XII, XIV, and XVI, 
344; and reaction evocation, 326 f.; 
and reaction potential, 226 f., 390 1; 
related to reaction-evocation prob- 
ability, 3261; resistance to experi- 
mental extinction, 337 f , 

Effector: and adaptation, 3841; dis- 
charge, 72 

“Emergentism”: meaning of, 261 
Empirical reaction threshold : in con- 
ditioning, 324 f.; Hill’s study, 3241; 
individual demonstrations of, 3241 
Entelechy, 258; Driesch’s, 23, 28 
Environment: external, 16; inani- 
mate, 16; internal, 16; organismic, 
16 

Equations: 1, 119; 2, 119; 3, 120; 4, 
120; 5, 6, 7, 8, 9, 10, 121; 11, 134; 
12, 134; 13, 160; 14, 162; 15, 163; 
16, 178; 17, 18, 19, 20, 179; 21, IS, 
23, 1^; 24, 25, 26, 27, 28, 181; 
199; 30, 200 ; 31, 201 f.; 33, 

34, 254 ; 35, 255 ; 36, 37, 38, 39, 40, 
41, 42, 300; 43, 301; 44, 319; 45, 



417 


INDEX OF SUBJECTS 


320; 46, 344 ; 47, 4S, 40, 51, 345; 

52, 53, 355 

Excitation gradient: interacting with 
extinction gradient, 265 f. 

Experimental extinction: corollaries, 
287, 203; as corrective mechanism, 
^2; disinhibition and, 272 f., 291 f.; 
Eiison's study, 270 f., 275 f.; exam- 
ples of, 259 f. ; as function of elec- 
tive reaction potential, 337 L, 347; 
as function of unreinforced reac- 
tions, 260 f.; habit strength and 
resistance to, 106 f.; Eovland’s 
studies, 260 f., 263 f., 268 f„ 270 f., 
275; influence of drugs, 236 f.; and 
inhibition, 391 f.; interaction of gra- 
dients, 265 f.; motivational status, 
391; Pavlovas studies, 259 f., 

270; perseverationai effects of, 
2^f.; Postulate XIY, 344; reaction 
generalization of, 267 f . ; and remi- 
niscence, 391 f.; as secondaiy phe- 
nomenon of reactive inhibition, 
277 f 391 f . ; spontaneous recovery 
in, ^9 f 391 f . ; stimulus generali- 
zation of, 262 f.; and unadaptive 
habits, 258 f.; Williams’ study, 
106 f. ; Zavadsfe^’s study, 272 f. 

Extinction, 88 (see aUo Experimental 
extinction) ; experimental, 106 f., 
2^f., 258 f.; generalized, 101; of 
secondary reinforcement, 90 f . ; 100 f . 

‘Tirst order” conditioned reflex, 85 

Fractional component: of goal r^to- 
tion, lOQ 

Frequency; of impul^s, 41 f. 

Functional autonomy: as self-rein- 
forcement, 101 

Functional dynamics: of compound 
conditioned stimuli, 204 f. 

Galvanic skin reaction, 103 f.; Hol- 
land’s study, 260 f. 

Generalization : and behavioral varia- 
bility, 321; of extinction effects, 
262 f.; and positive pattemi^, 
366 f.; response, 183; re^nse in- 
tensity, 316; stimulus, l^f^ 216 f., 
^9f.; stimulus-re^onse, 183 

Goal, 25, 95; attainment, 26; gradi- 
ent, 1(W, 145; as reinforcing 
state of affairs, 


Goal gradient hypothesis, 145; equa- 
tion 14, 162 ; formulation of, 142 f- ; 
and gradient of reinforcement, 1^; 
and habit increment, ^7 f. 

Gradient: anterior stimulus a^mchro- 
nism, 172, 1^; of excitation, 
of extinction, ^f.; of genemlim- 
tion, 187; goal, M, 100, 142 L, 145, 
160, 162; of habit strength, 173; 
posterior - stimulus asynchrosism, 
171, 180; of reinforcement, 94, i^f., 
159 f., 162, m 

Growth constant, 114, 119, 127, 129 

Growth function; negative, 145; p<^- 

tive, 104, 3^, 335 

Hab: computation of, 1191^ 1^; de- 
fined, 114 

Habit, 21, 74, 109; defined, 102; for- 
mation, 102, 204 f,; increment, 

387 f.; in learning, 387; and reit> 
forcement, 2^; strength, ICBf.; 
summation, 212 L; unadaptive, 258 f. 

Habit formation: amount of rein- 
forcing agent and cur\’e of, 127 f.; 
complexity of stimulus, 
working hypothesis of, 12S 

Habit increment per reinforcement: 
conditions influencing, 387 f.; goal 
gradient, 388; gradient of reinforce- 
ment, 388; primaiy reinforcing 
agent, 388; secondary reinforce- 
ment, stimulus-response asyn- 
chronism, 388 

Habit strength : acquired, ^ f. ; con- 
cept of, K^f.; condition, 1^; de- 
lay and reinfofement and, 135 f.; 
distribution of, duration of 

conditioned stimulus and, 165 f.; 
equations, 222 f.; as function of 
amount and nature of remforcing 
agent, 124 f.; as function of num- 
ber of reinforcements, 192 h, 112 
387; how to compute, 119!.; how 
to compute increment of, ; Kap- 
pauf and Schl{^>e3:g’s study, 165!.; 
loading, 207 f . ; |^r cent wfwt re- 
action evocation, If^f.; Puaiulate 
IV, 178; and primaiy motivation, 
390; qualification m quantitative 
^ientific eoi^ruet, 1^; in qmati- 
tative derivaticm of reaction poltn- 
tkl, 242 smd latency,^ 

104 f,; 
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103 1; and reaction potential corol- 
lary, 247; and resistance to experi- 
mental extinction, 106 f,; and stim- 
ulus drive corollaries, 248; stimulus 
generalization, 389; stimulus -re- 
sponse asynclironism, 165 f.; stimu- 
lus trace, 169 f.; symbolic represen- 
tation of, lllf.; theoretical curve 
of growth, 1141; Wolfe’s experi- 
ment, 1701 

Habit summation: corollaries, 2141; 
difficulty of applying equations, 
2231; principle of, 2121 
‘TEQgher order” conditioned reaction, 
85, 90; possibility of, 93 1 
Humphrey’s arpeggio paradox; confu- 
sion about, 374; resolved, 3721 
Hypothesis: Kauppauf - Schlosberg, 
168; Mowrer-Miller, 278; of neural 
interaction, 42 f.; of primary moti- 
vation, 2^1; of stimulus trace, 42 

Incentive: and amoimt of reinforcing 
agent, 131 f.; aspect of motivation, 
131 ; concept of, 131 ; Fletcher’s ex- 
periment on, 132; as secondary mo- 
tivation, 226 

Inhibition: a^ociative, 341; condi- 
tioned, 392; corollaries, 2^1; and 
disinhibition, 392; and effective re- 
action potential, 277 1 ; and experi- 
mental extinction, 3911; motiva- 
tional status, 391 f.; Postulates VIII 
and rX, 300; reaction potential, 
^1; reactive, 327, 391 1; of rein- 
forcement, 2891; reminiscence and, 
^11; and ^mntaneous recovery, 
^11 

Inhibitoy potential (see also Inhibi- 
tion): conditioning of, 2811; cor- 
ollaries, 2821, 288; and effective 
reaction potoitial, 281 f.; as logical 
coi^^ruci, 278; Mowrer and Jones’ 
^udy, 2791; Pavlov’s study, 2^1; 
Postulates Vm and IX, 300; prin- 
ciples related to, 277 1 ; problems 
c^ditionmg of, ^1 f.; quantita- 
tive of relationship 

of “wcrk” to, 

genaalh^tion, 2811; imiitsf, 2^1 
Hmate b^iavior: (ffiaractw^cs of. 
Bit; ami ^teicles towwd varia- 
,^1 


Intensity: of reaction, 3391 
Intents, 25 

Interaction: environmental - organ- 
ismic, 16 f 25 1 

Intervening variables: role of, 21 f 
23, 30, 57 f, 

iji.d., 190 

Kinaesthesis, 35 

Latency: of optic nerve fiber, 40; of 
reaction, 1041, 3361 
Law: all-or-none, 51 f.; of "effect,” 
78, 135 f.; of gravitation, 41, 11; of 
"least action,” 294; of “less work,” 
2931; of “minimal effort,” 294; of 
primary reinforcement, 71, 206; of 
probability, 160, 306, 3101, 316 f, 
328; of “recency,” 135; of rein- 
forcement, 98, 206, 258; of spon- 
taneous recovery, 284; Weber’s, 
1541 

Learning: adaptation in, 3861; baric 
curve of, 116; conditioned reflex a 
special case of, 76 f., 386; discrimi- 
nation, 2651; equations of, 389; 
equations fitted to curves of, 120 f.; 
example of, 70 f.; general nature of, 
681; and habit, 387; and latency 
of reaction, 110; and magnitude of 
reaction, 110; primitive trial and 
error, 386; and reinforcement, 
3861; selective, 701, 761; theo- 
retical curve of, 117 
Lhnen, 324 

Logical constructs, 211, 23, 113; of 
inhibitory potential, 276 f.; use of, 
111 

Magnitude: Hovland’s study, 1031; 
of reaction, 1031 

Major corollaries: 1, 199; H, 253; III, 
319; IV, 378; V, 379 
Maladaptive behavior, 25 
Mechanism: defined, 384 
Molar analysis, 112 
Molar behavior: contrasted with mo- 
lecular behavior, 201; explanation 
of, 17; task of, 19 
Molecular behavior, 201 
Momeafauy effective reaction poten- 
iaal, 3131; Postulate XI, 344; Pos- 
tulate XIH, 344; Postolate XV, 
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344; Postulate XVI, 344; mathe- 
matical statement- of postulates, 
344 f. 

Monotonic habit-reaction relation- 
ship: corollary, 215; principle of, 
212 f. 

Mote: unit of strength of primary 
drive, 238 

Motivated activity, 60 f.; aspects of, 
131; Pichter^s studio on hunger 
and sex drive, 62 f.; thirst, 60 f.; 
Wada^s hunger study, 61 f.; Wang's 
sex study, 63 

Motivation; and incentives, 226; pri- 
mary, 226 f.; secondary, 2^ 

Motor end-plate, 51 

Mowrer-Miiler hypothesis: statement 
of, 278; submolar principle from, 
278 f. 

Muscles: action of, 50 f.; all-or-none 
law, 51 f.; unlearned coordination 
of, 53 f. 

Keed: and drives, 57 f.; modal reac- 
tions to, 59 f,; or organian, 17 f.; 
reduction, 2^ 

Negative growth function: goal gra^ 
dient, 145; and habit strength, 145 

Negative patterning: Corollary IV, 
365; derived, 363 f.; by differential 
reinforcement, 363 f.; of simultane- 
ous stimulus compoimds, 363 f. 

Neural interaction (see also Afferent 
neural interaction): and configura- 
tion psychologies, 48 f.; hypothesis 
of, 42 f.; Pavlov's statement of, 
47 f.; Posenblueth’s study, 42 f. 

Neurological approach, 19 f. 

Normal law of probability: and be- 
havioral oscillation, 306, 310 f.; in 
behavioral sciences, 316 f.; and dis- 
tribution of o^niatory force, 314; 
equation of, 320; graphic represen- 
tation, 312; and magnitude of mus- 
cle contraction, 310; the ogive, 313; 
Peculate Xn, 344 

Objectivism : in behavior tiieoiy, 30; 
vs. teleology, 24 f. 

Operational ddSnition: Bridgman, 30 

Organisms: adaptive activity of, 
^4f.; survival of, 32 f.; as 
lYiamtaining mechanians, ^4f, 
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Oscillation, 45; coroHary, 
taneous, 149 f. 

Pattern, 44 

Patterning (see also N^a-tive pat- 
terning, Positive patteming. Si- 
multaneous stimulus patterning, 
Spontaneous stimulus patterning, 
Stimulus patterns, and Temporal 
stimulus patterning) ; corollaries of, 
358 f^ 363, 365, MS, 371 f.; empirical 
patterning coefficient, 3^; equa- 
tions of patterning coefficient, 355, 
379; functional dynami?^ of, 374 f,, 
^f.; and GestaU j^’chol£^% 
379 f.; Humphrey's arpe^o fmm- 
dox resolved, 372 f.; Major Corol- 
lary IV, 378; Major Corolbry ¥, 
379; ne^tive, 350, 353, 363 f., m L; 
Pavlov’s work, 350 f.; phyriolc^ical 
summation in, 397; positive, 350, 
352, 360 L; principles involv^ in, 
354 f . ; simultaneous stimuiis, 350 f . ; 
spontaneous, 356 f.; temporal stim- 
iilus, 350, 368 f.; riieoretical pat- 
terning coefficient, 355; Woodtoy’s 
study, 351 f. 

Pav: unit of inhibitoiy potmtial, 
280 f. 

Perseveration: Lorente de No’s study, 
42; nature of, 41 f.; mr reverb^a- 
tion, 42 

Perserverative i^iinuli^ tnw^; and 
adaptation, 385; and aekiptive be- 
havior, 385 

Persevemtive brace, 100, 71; and toa- 
p<u*al stimuli^ irnttarnic^, 371 

PhyBiol<^c»l: maximum, 114; mh- 
mation, 102, ^f^ 3Sff 

Positive growth fancriem, 229 3^, 

335; b^c principle of, 114; and 
theoretical curve in 
formation, 114 f. 

Postive patterning; Corolkry HI, 
36S; Coroikiy V, 3^; with de- 
creased gei^ralimticm, i^f.; <te- 
rived, 360 f.; by differential rein- 
foixsement, 360 f.; of simuita3^£^ 
stimulus (mmpounds^ 3601. 

Posterior stimulus 
dient, 171 

PostuWtef: m primary principlw, 2 
26; I, 47; 11, 47; HI, 66; TV, ITS; 
V, 1^; VI and VII, 253; VHI &i 
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IX, 30); X, 319; XI, XII, XIII, 
XTV, XV, and^ XVI, 344 

Primary motivation: concept of re- 
action potential, 239 f.; concept of 
strength of primary drive, 238 f.; 
differential reactions to identical 
situations, 233 f.; drive as a factor 
in, 226 f.; effect of sex hormones, 
237 f.; and extinction, 391; habit 
strength as factor in, 226 f., 390; 
influence of drugs, 236 f.; Major 
Corollary 11, 253 f.; mediation of 
behavior, 390; Postulates VI and 
VII, 253 f.; and reaction potential, 
226 f., 390; stimulus-intensity gen- 
eralization applied to drive, 235 f.; 
as symbolic construct, 390; twelve 
corollaries of, 247 f. 

Primary reinforcement, 68 f.; in con- 
ditioning, 76; critical factor in, 81; 
Finan^s study, 81 f.; law of, 71, 206; 
learning, a process of, 71 ; in stimu- 
lus compound, 206; and Thorn- 
dike’s ^flaw of effect,” 80. 

Primary stimulus generalization: cor- 
ollary, 219; principle of, 216 f.; 
Shepard and Fogelsanger’s study, 
219 

Primary stimulus-intensity generali- 
zation: applied to drive stimulus, 
^f. 

Principle: of afferent neural interac- 
tion, 47 f., 216 f.; and axioms, 2; of 
habit summation, 212 f.; molar, 20; 
molecular, 2^ ; of monotonic habit- 
reaction relationships, 212 f.; as 
postulates, 2; primary, 2f., 25; of 
primary stimulus-intensity generali- 
zation, 216 f-, 235 f.; secondary, 2f*, 
25; of stimulation, 39 

Probability, 262 

Quasi-patterning. See Spontaneous 
patterning 

It^fcetion: amplitude, 339 f., 344, 347; 
different, 233 f.; effect of drugs, 
2^f.; evocatim, 107 f.; galvanic 
dkin, IC^f.; generalization, 267 f.; 
intei^ty, 339 f.; latency, 104 f., 
336 f., 344, 3^; m^nitude, K^f,; 
poteitial, 226 f.; threshold, 322 f., 

m 


Reaction evocation : Bertha lutzi 
Hull’s study on habit strength and 
per cent of, 107 f.; and compound 
stimulus aggregates, 209 f.; corollar- 
ies, 328, 333 f.; derivation of, 328 f.; 
effective habit strength and, 210 f.; 
and effective reaction potential, 
326 f.; learning curves of, 330 f., 
346; paradox of, 314 f.; probability 
of, 308 f., 326 f.; and reaction po- 
tential, 394 f.; and reaction thr^- 
old, 3261, 3941 

Reaction-evocation probability : and 
behavioral oscillation, 308 f.; deri- 
vation of, 3281; relation to effec- 
tive reaction potential, 326 f., 3941 
Reaction generalization: Ellson’s 
study, 2681; of extinction effects, 
2671 

Reaction potential: corollaries, 2471, 
2831, 287, 289, 296; definition of, 
2391; effective, 2811, 3921; and 
forgetting, 395; incompatible, 341; 
and inhibition, 392 f.; Major Cor- 
ollary II, 253; Postulate VIE, 253 1; 
Postulate Vm, 300; and primary 
motivation, 2261, 390; as primary 
motivational concept, 238 f.; quan- 
titative derivation of, 2421; and 
reaction threshold, 394 f.; response 
evocation, 3941; successive extinc- 
tion of same, 2861; units of, 239 
Reaction threshold: in conditioning, 
324 f . ; chronaxie determinations, 
324; defined, 324, 3941; empirical, 
324; Hill’s study, 3241; and num- 
ber of reinforcements, 394 f.; oscil- 
lation components ol 3451; and 
oscillation function, 3251; postu- 
lates of, 344; reaction potential, 
3941; and response evocation, 
3221, 3941; true or initial, 325 
Reactive inhibition, 327, 391 f.; ex- 
perimental extinction arising from, 
2771; and Mowrer-Miller h 3 ^oth- 
esis, 2/8 

Receptor: and adaptation, 3841; 
analysis of environmental energies, 
401; di^harge, 40 1, 71 1, 206; q>e- 
cialized, 18; types of, 331 
Receptor-effector connections: acqui- 
sition of new, 681, 841; in condi- 
tioning, 76; innate, 691; new, 731; 
strengthening of innate, 68 f., 72 
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Heceptor-e^ector convergence, 1S4, 
192 

Refies; chains, 55 f.; heterogenenous, 
26S; homogeneous, 26S; stepping, 
54f. 

Reification, 2S 

Reinforcement : corollaries, 291 , 2^ f . ; 
differential, 360 f., 363 f.; distrib- 
uted, 295 1 .; gradient of, 94; habit 
formation and delay of, 135 f.; 
and habit increment, 3S7f.; habit 
strength and number of, 112 f., 387; 
inhibition of, 260 f., 2S9f.; law of, 
98; and learning, 386 L; primary, 
68 f.; secondary, Mf., 387 

Reinforcing agent: Cowles’ study, 
SQL; Grindley’s experiment, 91 f,; 
habit strength as a function of na- 
ture and amount of, 124 f.; food 
reward as, 98 f.; possible identity 
of proceises, 99 f.; primary, 89, 388; 
problem of incentive and amount 
of, 131 f., 134; secondary, 85, 86, 
89 f., 99 

Reinforcing factor: onset or termina- 
tion of need-receptor impulse, 82 f. 

Reminiscence: Calvin’s study, 296; 
corollaries, 296; and distributed 
reinforcements, 295 f.; and extinc- 
tion, 391 f.; inhibition and, 3911; 
phenomenon of, 2951 

Reverberation, 42 ; of neural im- 
pulses, 41 f.; and perseveration, 
41 1 

Reward: as incentive, 131 

Robot: use of, 27 

Science : empirical aspects, 1 f . ; theo- 
retical aspects, 1 1 

Scientific theory: anthropomorphism 
in, 382 ; aspects of science, 1 f 381 ; 
constituting a logical hierarchy, 5 f., 
3811; deductive nature of, 21; 
definition of, 21; differing from ar- 
gumentation, 7f.; the nature of, 
1 1, 381 1; Newton’s system, 7 ; and 
probability, 10, 12; and ^mpling, 
10, 12; substantiation of postulates, 
121; theoretical and empirical con- 
tributions, 9f.; ''truth” status, 131 

"Second order” conditioned reaction, 
85, 90 

Secondary reinforcement: Bugelski’s 
study, 88; differential causal effi- 


cacy of, 891; existence of, 841; 
extinction of, ^1; Frolov’s study, 
S41; and habit increment, 3871; 
problems coneeming, M; reactions 
subject to, 871; role in compound 
selective learning, 951, 387; Skin- 
ners study, 87 f. 

Secondarv' stimulus genera lizat ion, 

191 1 

Selective learning, 701, 77; amount 
of reinforcing agent affecting rate 
of, 125 f.; compared with condi- 
tioned-reflex learning, 76 f.; Grind- 
ley’s experiment, 1251; Wolfe and 
Kaplona study, 127 
Sensitization, 211 

Sex hormones : Beach’s mnv'ey, 237 f . ; 
effect on motivation, ^1, ^7f.; 
Stone’s study, 231 
Sham feeding, 99 

Simple di^remination learning: re- 
milting from mteraetion of gradi- 
ents, ^1 

Simultaneous stimuli^ patterning 
(see aho Patterning) : experimental 
examples of, 3501; Major Corol- 
lary IV, 378; Pavlov’s work, 3501; 
^ntaneou^ 3561; Woodbury’s 
Mudy, 3511 

Specialized receptors, IS 
Spontaneous di^harge, 59 
Spontaneous emi^ion: of neural im- 
pulses, 44 f., 310; Weiss’s investi^- 
tion, 45, 310 

Spontaneous oscillat ion : Blair and 
Erlanger's investigation, ^®1; of 
neural conductors, 3(01; as varia- 
bility in habit strength, 1491 
Spontaneous recover^’': eorollaric®, 
2S4f^ 287; Elison’s study, 2'^f^ 
275 f.; of extinction effects, ^01, 
3911; ineompletene^ of, ^1; 
and inhibition, Pavlov’a 

study, 270 

Spontaneous stimulus patterning: 
Corollary I, 3^1; Corollary II, 
359; derived, 356 f.; wdth increased 
neural intemction, 359 
Stimulation: principle of, ^ 
Stimulus: ag^gate, ^7, WG 1,3491; 
analy^ of environmental energi^, 
401; complexity in typical condi- 
tioning, 2041; compound, Wf^ 
3951 (Bee aho Patterning) ; dimen- 
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sion, 18Sf.; element, 207, 349 f.; 
generalization, 183 L, 216 f., 219; 
nature of, 32 f.; trace, 42 
Stimulus compounds (see also Pat- 
terning) : dynamics of, 395 f . ; physi- 
ological summation in, 395 f. 
Stimulus dimension: concept of, 
188 f.; Postulate V, 199 
Stimulus-evocation paradox, 194 f.; 
its resolution, 196 

Stimulus generalization: of condi- 
tioned inhibition, 281 f.; corollaries, 
285 f.; dimensions of, 389 f.; of 
extinction effects, 262 f.; and habit 
strength, 389; Hovland^s studies, 
184 f., l^f., 200 f.; incompleteness 
of, ^f.; Lumsdaine^s experiment, 
192 f.; Major Corollary I, 199; by 
means of identical stimulus com- 
ponents, 190 f.; mediation of behav- 
ior, 389; Postulate V, 199; primary 
stimulus intensity, 186 f.; primary 
stimulus quality, 184 f.; principle 
of, 216 f.; secondary, 191 f; SMp- 
ley’s experiment, 192; units of 
measurement, 189 f. 

Stimulus patterns (see also Pattern- 
ing): "calculus” of adaptive prob- 
ability, 375; compound trial and 
error, 375; functional dynamics of, 
374 f.; serial trial and error, 376 
Stimuius reception, 32 f.; of move- 
ment, 35 f.; of spatial relation^ps, 
^1; of temporal relationship, 39 
Stimulus-respon^ assmchronism : an- 
terior gradient, 172, 180; five cor- 
ollaries of, 168 f.; and habit incre- 
ment, 388; and habit strength, 
165 f,; a neurologirad hypothesis of, 
posterior gradient, 171, 180 
Stimulus taaee, 42; perseverative, 385 
Strength of drive stimulus: corol- 
kri^ of, 247 L; Heathers and 
Arakelian’s study, 235; Major Cor- 
<^Iary n, 253; physiologicd inter- 
pretarion, 240 f.; Pt^tulate VI, 
253 f.; as primary motivational con- 
238 f.; in quanritative deter- 
minaticm of leacUon poteitial, 
2121.; stimuhis-intenrity ga^eraii- 


zation applied to, 2351; unit of 
238 

Strivings, 25 
Subgoals, 90 

Subjectivism, 27; in behavior theory, 
30; prophylaxis against, 27 f. 
Survival: of organism, 17, 321 
Symbolic constructs: behavioral os- 
cillation, 393; in behavior theory, 
2821; habit strength, 1021; pri- 
mary motivation, 390; reaction 
potential, 239 f.; strength of pri- 
mary drive, 2381; summary of, 
383 

S 3 Tnbolic representation; of habit 
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